Streaming Off-Llama 3 70B

Number of APIs: 7

Chat (NVIDIA AI - llama3-70b) POST https://integrate.api.nvidia.com/v1/chat/completions
Chat (OctoAI - meta-llama-3-70b-instruct) POST https://text.octoai.run/v1/chat/completions
Chat (Anyscale - llama-3-70b-chat-hf) POST https://api.endpoints.anyscale.com/v1/chat/completions
Chat (Fireworks AI - llama-v3-70b-instruct) POST https://api.fireworks.ai/inference/v1/chat/completions
Chat (Groq - llama3-70b-8192) POST https://api.groq.com/openai/v1/chat/completions
Chat (Deep Infra - meta-llama/Meta-Llama-3-70B-Instruct) POST https://api.deepinfra.com/v1/openai/chat/completions
Chat (Lepton AI - llama3-70b) POST https://llama3-70b.lepton.run/api/v1/chat/completions