Tom Fenton moves from local AI concepts to hands-on tools for matching LLMs to hardware, running local chatbots with Ollama and benchmarking AI performance.
We differentiate between observers and stores. Observers wrap generative AI APIs (like OpenAI or llama-index) and track their interactions. Stores are classes that sync these observations to different ...