{"type":"rich","version":"1.0","provider_name":"Transistor","provider_url":"https://transistor.fm","author_name":"Machine Learning Tech Brief By HackerNoon","title":"Local LLMs Need More Than OpenAI-Compatible Endpoints","html":"<iframe width=\"100%\" height=\"180\" frameborder=\"no\" scrolling=\"no\" seamless src=\"https://share.transistor.fm/e/9326bf07\"></iframe>","width":"100%","height":180,"duration":983,"description":"\n        This story was originally published on HackerNoon at: https://hackernoon.com/local-llms-need-more-than-openai-compatible-endpoints.\n             Respawn is a stateful OpenAI Responses API gateway for local LLMs, adding stored responses, tools, streaming, files and observability to Ollama.  \n            Check more stories related to machine-learning at: https://hackernoon.com/c/machine-learning.\n            You can also check exclusive content about #ai, #llm, #open-source, #ollama, #self-hosted-ai, #api, #openai, #local-ai,  and more.\n            \n            \n            This story was written by: @robertomanfreda. Learn more about this writer by checking @robertomanfreda's about page,\n            and for more stories, please visit hackernoon.com.\n            \n                \n                \n                Local LLM servers are great at generating tokens, but modern clients expect more than inference: state, lifecycle endpoints, streaming shape, tool protocol, files, and metrics. Respawn is an open-source gateway that sits in front of Ollama/self-hosted backends and adds OpenAI Responses API semantics locally.\n        \n        ","thumbnail_url":"https://img.transistorcdn.com/KyA01h2FD2insgk-wX_xzV6vbJnTNl2BvPYVL-XaI9A/rs:fill:0:0:1/w:400/h:400/q:60/mb:500000/aHR0cHM6Ly9pbWct/dXBsb2FkLXByb2R1/Y3Rpb24udHJhbnNp/c3Rvci5mbS9zaG93/LzQxMjcyLzE2ODM1/ODI0ODgtYXJ0d29y/ay5qcGc.webp","thumbnail_width":300,"thumbnail_height":300}