Z.ai

All supported Z.ai models can be found here

Z.ai is an AI platform that provides cutting-edge large language models powered by GLM series. Their flagship models feature Mixture-of-Experts (MoE) architecture with advanced reasoning, coding, and agentic capabilities.

For latest pricing, visit the pricing page.

Resources

Website: https://z.ai/
Documentation: https://docs.z.ai/
API Documentation: https://docs.z.ai/api-reference/introduction
GitHub: https://github.com/zai-org
Hugging Face: https://huggingface.co/zai-org

Supported tasks

Chat Completion (LLM)

Find out more about Chat Completion (LLM) here.

Language

Client

Provider

Settings

import os
from huggingface_hub import InferenceClient

client = InferenceClient(
    provider="zai-org",
    api_key=os.environ["HF_TOKEN"],
)

completion = client.chat.completions.create(
    model="zai-org/GLM-4.6",
    messages=[
        {
            "role": "user",
            "content": "What is the capital of France?"
        }
    ],
)

print(completion.choices[0].message)

< > Update on GitHub

Inference Providers

Z.ai

Resources

Supported tasks

Chat Completion (LLM)