Today we’re releasing our Inference API that serves Nous Research models. We heard your feedback, and built a simple system to make our language models more accessible to developers and researchers everywhere.
The initial release features two models - Hermes 3 Llama 70B and DeepHermes 3 8B Preview (with more coming soon)
To ensure a smooth rollout, we implemented a waitlist system at the Nous Portal which can be found here: Nous Portal
- Access will be granted on a first-come, first-served basis
- Once granted access, you can create API keys and purchase credits
- This is an OpenAI-compatible completions and chat completions API
- Right now all accounts start off with $5.00 of free credits.
Any questions, let us know. Ongoing feedback and ideas are always welcome.
We’re going to be starting with the cheapest inference we can offer at cost-basis for Hermes models, and as we scale up users + as the infrastructure for intelligence on Psyche grows, we’ll adjust the offerings based on demand.