<?xml version="1.0"?>
<feed xmlns="http://www.w3.org/2005/Atom" xml:lang="en">
	<id>https://ephemerawiki.space/index.php?action=history&amp;feed=atom&amp;title=Help%3AEphemera_Agent%2FLLM_Providers</id>
	<title>Help:Ephemera Agent/LLM Providers - Revision history</title>
	<link rel="self" type="application/atom+xml" href="https://ephemerawiki.space/index.php?action=history&amp;feed=atom&amp;title=Help%3AEphemera_Agent%2FLLM_Providers"/>
	<link rel="alternate" type="text/html" href="https://ephemerawiki.space/index.php?title=Help:Ephemera_Agent/LLM_Providers&amp;action=history"/>
	<updated>2026-05-19T12:45:42Z</updated>
	<subtitle>Revision history for this page on the wiki</subtitle>
	<generator>MediaWiki 1.42.3</generator>
	<entry>
		<id>https://ephemerawiki.space/index.php?title=Help:Ephemera_Agent/LLM_Providers&amp;diff=9&amp;oldid=prev</id>
		<title>EphemeraAdmin: Import: Help:Ephemera Agent/LLM Providers</title>
		<link rel="alternate" type="text/html" href="https://ephemerawiki.space/index.php?title=Help:Ephemera_Agent/LLM_Providers&amp;diff=9&amp;oldid=prev"/>
		<updated>2026-04-01T04:29:33Z</updated>

		<summary type="html">&lt;p&gt;Import: Help:Ephemera Agent/LLM Providers&lt;/p&gt;
&lt;p&gt;&lt;b&gt;New page&lt;/b&gt;&lt;/p&gt;&lt;div&gt;{{DISPLAYTITLE:Ephemera Agent — LLM Providers}}&lt;br /&gt;
__TOC__&lt;br /&gt;
&lt;br /&gt;
== Supported Providers ==&lt;br /&gt;
&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot; style=&amp;quot;width:100%&amp;quot;&lt;br /&gt;
! Provider !! Available Models !! API Key Location&lt;br /&gt;
|-&lt;br /&gt;
| Claude (Anthropic) || claude-sonnet-4-20250514, claude-opus-4-20250514, claude-haiku-4-5-20251001 || [https://console.anthropic.com console.anthropic.com]&lt;br /&gt;
|-&lt;br /&gt;
| GPT (OpenAI) || gpt-4o, gpt-4o-mini, gpt-4-turbo || [https://platform.openai.com/api-keys platform.openai.com/api-keys]&lt;br /&gt;
|-&lt;br /&gt;
| Gemini (Google) || gemini-2.5-pro, gemini-2.5-flash, gemini-2.5-flash-lite, gemini-3.1-pro-preview, gemini-3-flash-preview, gemini-3.1-flash-lite-preview || [https://aistudio.google.com/apikey aistudio.google.com/apikey]&lt;br /&gt;
|-&lt;br /&gt;
| Custom endpoint || Any model name || Varies by provider&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
== Custom / OpenAI-Compatible Endpoints ==&lt;br /&gt;
&lt;br /&gt;
Select &amp;#039;&amp;#039;&amp;#039;Custom (OpenAI-compatible)&amp;#039;&amp;#039;&amp;#039; from the provider dropdown, then enter:&lt;br /&gt;
* &amp;#039;&amp;#039;&amp;#039;API Endpoint URL&amp;#039;&amp;#039;&amp;#039; — the full URL of the &amp;lt;code&amp;gt;/chat/completions&amp;lt;/code&amp;gt; endpoint (e.g. &amp;lt;code&amp;gt;https://api.groq.com/openai/v1/chat/completions&amp;lt;/code&amp;gt;)&lt;br /&gt;
* &amp;#039;&amp;#039;&amp;#039;Model name&amp;#039;&amp;#039;&amp;#039; — exact model string the endpoint expects (e.g. &amp;lt;code&amp;gt;llama-3.3-70b-versatile&amp;lt;/code&amp;gt;)&lt;br /&gt;
* &amp;#039;&amp;#039;&amp;#039;API key&amp;#039;&amp;#039;&amp;#039; — the provider&amp;#039;s API key&lt;br /&gt;
&lt;br /&gt;
Compatible with: Groq, Mistral, Together AI, Fireworks, Perplexity, and others. Also compatible with locally-hosted models via Ollama or LM Studio — expose them with a tunnel (e.g. ngrok) to make them reachable from the server.&lt;br /&gt;
&lt;br /&gt;
== Tiered Model Routing ==&lt;br /&gt;
&lt;br /&gt;
The system uses two separate LLM calls per generative task:&lt;br /&gt;
&lt;br /&gt;
; Planner tier&lt;br /&gt;
: Handles task classification and entity extraction. Should be a fast, cheap model. The input is small and the output is structured JSON — no creativity needed. Recommended: Haiku, Flash-Lite, GPT-4o-mini.&lt;br /&gt;
&lt;br /&gt;
; Generator tier&lt;br /&gt;
: Handles actual content creation. Receives the full assembled context. Use the best model available for the quality of output you need.&lt;br /&gt;
&lt;br /&gt;
Configure each tier independently in the &amp;#039;&amp;#039;&amp;#039;SETTINGS&amp;#039;&amp;#039;&amp;#039; tab. Settings persist via localStorage.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
== Recommendations ==&lt;br /&gt;
&lt;br /&gt;
* Use a fast, low-cost model for the Planner tier.&lt;br /&gt;
* Use the strongest available model for the Generator tier.&lt;br /&gt;
* If tool calling behaves unexpectedly, try switching providers before assuming the prompts or backend are at fault.&lt;br /&gt;
&lt;br /&gt;
== Key Storage ==&lt;br /&gt;
&lt;br /&gt;
API keys entered manually are stored in browser memory only for the duration of the session. Loading a [[Help:Ephemera Agent/Configuration#Keys File|keys file]] provides the same memory-only storage with one-click convenience.&lt;br /&gt;
&lt;br /&gt;
Keys are &amp;#039;&amp;#039;&amp;#039;never&amp;#039;&amp;#039;&amp;#039; written to disk, stored in cookies, or sent to any server other than the relevant AI provider&amp;#039;s own API.&lt;br /&gt;
&lt;br /&gt;
[[Category:Help]]&lt;br /&gt;
[[Category:EphemeraAgent]]&lt;/div&gt;</summary>
		<author><name>EphemeraAdmin</name></author>
	</entry>
</feed>