SUNNYVALE, Calif.--(BUSINESS WIRE)--Last week, Nvidia announced that 8 Blackwell GPUs in a DGX B200 could demonstrate 1,000 tokens per second (TPS) per user on Meta’s Llama 4 Maverick. Today, the same ...
Enterprises will be able to access Llama models hosted by Meta, instead of downloading and running the models for themselves. Meta has unveiled a preview version of an API for its Llama large language ...
At its inaugural LlamaCon AI developer conference on Tuesday, Meta announced an API for its Llama series of AI models: the Llama API. Available in limited preview, the Llama API lets developers ...