Streaming Off-Llama 3.1 405B
Number of APIs: 4
-
Chat (NVIDIA AI - llama-3.1-405b-instruct) POST https://integrate.api.nvidia.com/v1/chat/completions
-
Chat (Together AI - Meta-Llama-3.1-405B-Instruct-Turbo) POST https://api.together.xyz/v1/chat/completions
-
Chat (Fireworks AI - llama-v3p1-405b-instruct) POST https://api.fireworks.ai/inference/v1/chat/completions
-
Chat (OctoAI - meta-llama-3.1-405b-instruct) POST https://text.octoai.run/v1/chat/completions