- Added Llama model cost multiplier with LLAMA_MODEL_MULTIPLIER environment variable
- Enhanced usage tracking with automatic Llama model detection
- Deprecated discontinued Groq models (llama-3.1-70b, mistral-saba-24b, qwen-qwq-32b)
- Migrated Cerebras models to OpenRouter endpoints
- Updated model configurations in LLM proxy