Provider Support
================

LLM Batch Helper supports multiple LLM providers. Each provider has specific configuration requirements and supported models.

🎉 **New in v0.3.0**: Simplified API - all examples now work without async/await!

Supported Providers
-------------------

OpenAI
~~~~~~

The OpenAI provider supports all OpenAI chat completion models.

**Setup:**

.. code-block:: bash

   export OPENAI_API_KEY="your-openai-api-key"

**Supported Models:**

- ``gpt-4o``
- ``gpt-4o-mini``
- ``gpt-4``
- ``gpt-4-turbo``
- ``gpt-3.5-turbo``

**Example:**

.. code-block:: python

   from llm_batch_helper import LLMConfig, process_prompts_batch

   config = LLMConfig(
       model_name="gpt-4o-mini",
       temperature=1.0,
       max_completion_tokens=500
   )

   results = process_prompts_batch(
       config=config,
       provider="openai",
       prompts=your_prompts
   )

OpenRouter (Recommended)
~~~~~~~~~~~~~~~~~~~~~~~~

The OpenRouter provider gives you access to 100+ models from all major providers through a single API.

**Setup:**

.. code-block:: bash

   export OPENROUTER_API_KEY="your-openrouter-api-key"

**Popular Models:**

- ``openai/gpt-4o`` - OpenAI's latest model
- ``openai/gpt-4o-mini`` - Fast and cost-effective
- ``anthropic/claude-3-5-sonnet`` - Anthropic's latest
- ``deepseek/deepseek-v3.1-base`` - High-performance reasoning
- ``meta-llama/llama-3.1-405b-instruct`` - Meta's largest model
- ``google/gemini-pro-1.5`` - Google's multimodal model
- And 90+ more models from all major providers!

**Example:**

.. code-block:: python

   from llm_batch_helper import LLMConfig, process_prompts_batch

   config = LLMConfig(
       model_name="deepseek/deepseek-v3.1-base",  # or any OpenRouter model
       temperature=1.0,
       max_completion_tokens=500
   )

   results = process_prompts_batch(
       config=config,
       provider="openrouter",  # Access to 100+ models!
       prompts=your_prompts
   )

**Benefits of OpenRouter:**

- Access to 100+ models from all major providers
- Unified API for all models
- Competitive pricing
- No need for multiple API keys
- Easy model switching

Together.ai
~~~~~~~~~~~

The Together.ai provider supports various open-source models hosted on Together.ai.

**Setup:**

.. code-block:: bash

   export TOGETHER_API_KEY="your-together-api-key"

**Popular Models:**

- ``meta-llama/Meta-Llama-3.1-8B-Instruct-Turbo``
- ``meta-llama/Meta-Llama-3.1-70B-Instruct-Turbo``
- ``mistralai/Mixtral-8x7B-Instruct-v0.1``
- ``NousResearch/Nous-Hermes-2-Mixtral-8x7B-DPO``
- ``togethercomputer/RedPajama-INCITE-Chat-3B-v1``

**Example:**

.. code-block:: python

   from llm_batch_helper import LLMConfig, process_prompts_batch

   config = LLMConfig(
       model_name="meta-llama/Meta-Llama-3.1-8B-Instruct-Turbo",
       temperature=1.0,
       max_completion_tokens=300,
       system_instruction="You are a helpful AI assistant."
   )

   results = process_prompts_batch(
       config=config,
       provider="together",
       prompts=your_prompts
   )

OpenRouter
~~~~~~~~~~

The OpenRouter provider provides access to a wide variety of language models from different providers through a unified API.

**Setup:**

.. code-block:: bash

   export OPENROUTER_API_KEY="your-openrouter-api-key"

**Popular Models:**

- ``openai/gpt-4o``
- ``openai/gpt-4o-mini``
- ``anthropic/claude-3-5-sonnet``
- ``meta-llama/llama-3.1-405b-instruct``
- ``google/gemini-pro-1.5``
- ``mistralai/mixtral-8x7b-instruct``

**Example:**

.. code-block:: python

   from llm_batch_helper import LLMConfig, process_prompts_batch

   config = LLMConfig(
       model_name="openai/gpt-4o-mini",
       temperature=0.7,
       max_completion_tokens=500,
       system_instruction="You are a helpful AI assistant."
   )

   results = await process_prompts_batch(
       config=config,
       provider="openrouter",
       prompts=your_prompts
   )

Provider Comparison
-------------------

.. list-table::
   :header-rows: 1

     * - Feature
     - OpenAI
     - Together.ai
     - OpenRouter
  * - Model Variety
     - OpenAI models only
     - Many open-source models
     - 100+ models from all providers
  * - Pricing
     - Per-token pricing
     - Competitive pricing
     - Varies by model, competitive
  * - Rate Limits
     - Tier-based limits
     - Model-dependent limits
     - Credit-based system
  * - Response Quality
     - Very high (GPT-4)
     - Varies by model
     - Depends on chosen model
   * - Speed
     - Fast
     - Varies by model

Configuration Best Practices
-----------------------------

Temperature Settings
~~~~~~~~~~~~~~~~~~~~

.. code-block:: python

   # For factual/deterministic responses
   config = LLMConfig(temperature=0.0)
   
   # For balanced creativity
   config = LLMConfig(temperature=0.7)
   
   # For highly creative responses
   config = LLMConfig(temperature=1.0)

Concurrency Management
~~~~~~~~~~~~~~~~~~~~~~

.. code-block:: python

   # Conservative (good for testing)
   config = LLMConfig(max_concurrent_requests=2)
   
   # Balanced (recommended)
   config = LLMConfig(max_concurrent_requests=5)
   
   # Aggressive (for high-throughput)
   config = LLMConfig(max_concurrent_requests=10)

Token Management
~~~~~~~~~~~~~~~~

.. code-block:: python

   # Short responses
   config = LLMConfig(max_completion_tokens=100)
   
   # Medium responses  
   config = LLMConfig(max_completion_tokens=500)
   
   # Long responses
   config = LLMConfig(max_completion_tokens=2000)

Error Handling by Provider
---------------------------

Each provider may have different error conditions:

**OpenAI Errors:**

- Rate limit exceeded
- Invalid API key
- Model not found
- Token limit exceeded

**Together.ai Errors:**

- Rate limit exceeded
- Invalid API key
- Model not available
- Request timeout

The package automatically retries on transient errors with exponential backoff.

Advanced Usage
--------------

Custom System Instructions
~~~~~~~~~~~~~~~~~~~~~~~~~~

.. code-block:: python

   config = LLMConfig(
       model_name="gpt-4o-mini",
       system_instruction="""
       You are an expert technical writer. 
       Always provide clear, concise explanations.
       Include code examples when relevant.
       """
   )

Provider-Specific Optimizations
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

.. code-block:: python

   # OpenAI: Optimized for speed
   openai_config = LLMConfig(
       model_name="gpt-4o-mini",
       max_concurrent_requests=10,
       temperature=0.7
   )

   # Together.ai: Optimized for cost
   together_config = LLMConfig(
       model_name="meta-llama/Meta-Llama-3.1-8B-Instruct-Turbo",
       max_concurrent_requests=5,
       temperature=0.8
   )