Skip to content

Version 1.2.0

Latest
Compare
Choose a tag to compare
@peterbanda peterbanda released this 24 Apr 11:00

1. ResponsesAPI

  • Unified function/endpoint combining chat simplicity with tool use and state management.
  • Out-of-the-box tool support - It natively supports first-party tools like web_search, file_search, and computer_use, enabling you to invoke these capabilities without additional orchestration.
  • Built-in multi-turn conversation chaining-  Use the previous_response_id parameter to link requests into a chain of turns, and the instructions parameter to inject or override system/developer messages on a per-call basis.
  • Multimodal input and output - Beyond text, the API accepts images and audio in the same request, letting you build fully multimodal, tool-augmented experiences in a single call.

2. Core and OpenAI Enhancements

  • JSON mode handling improvements and fallback json-repair implementation (port of json-repair by @mangiucugna)
  • New models: o3, o4-mini, gpt-4.1, and gpt-4.5 series
  • Web search support (gpt-4o-search-preview)
  • Chat completion parameters expanded (store, reasoning_effort, service_tier, parallel_tool_calls, metadata)
  • Streaming and non-streaming IO conversion adapters developed and enhanced
  • Token counting updated (jtokkit v1.1.0)
  • Usage analytics improved

3. Anthropic Platform Enhancements

  • Thinking and streaming settings integration
  • Claude 3.7 Sonnet (Vanilla and via Bedrock)
  • Citations handling, text blocks encoding improvements
  • Caching support
  • Enhanced token-limit error handling and mapped Anthropic to OpenAI exceptions
  • A ton of new examples (also for Vision and PDF processing)

4. Google Gemini Integration

  • New Google Gemini module and models introduced (Gemini 2.5 / 2.0 Pro and Flash)
  • Gemini JSON schema handling improved, including OpenAI wrapper integration
  • System message caching, domain content management, and usage tracking adjustments
  • Btw. Google Vertex now also supports JSON schema mode

5. Perplexity Sonar Integration

  • New Perplexity Sonar module and models introduced (sonar-deep-research, reasoning-pro, sonar-pro, etc.)
  • Sonar JSON and regex response support, and citations formatting/handling
  • OpenAI chat completion wrappers

6. Other Providers: Deepseek, Groq, Grok, FireworksAI, and Novita

  • Groq JSON handling unified and adjusted, with deepseek-r1-distill-llama-70b integration
  • JSON schema handling for Grok models
  • FireworksAI improvements (document inlining), Deepseek model integrations
  • Message conversions, filtering thinking tokens, reasoning effort examples
  • Llama 4 family
  • New Deepseek models (deepseek-r1, DeepSeek-R1 distill) across providers (FireworksAI, Groq, Together AI), plus other models such as Phi-3-vision-128k-instruct, Deepseek-v2-lite-chat, and Llama-3.3-70b
  • New chat completion provider: Novita — Welcome to the family!

7. General Project Setup and CI/CD

  • Build setup adjustments (build.sbt registrations, env helpers)
  • GitHub CI - upload-artifact version bump (to v4)
  • Example datasets added (e.g., norway_wiki dump), imports optimized
  • README extended with more examples