|
Metadata-Version: 2.4 |
|
Name: groq |
|
Version: 0.15.0 |
|
Summary: The official Python library for the groq API |
|
Project-URL: Homepage, https://github.com/groq/groq-python |
|
Project-URL: Repository, https://github.com/groq/groq-python |
|
Author-email: Groq <[email protected]> |
|
License-Expression: Apache-2.0 |
|
License-File: LICENSE |
|
Classifier: Intended Audience :: Developers |
|
Classifier: License :: OSI Approved :: Apache Software License |
|
Classifier: Operating System :: MacOS |
|
Classifier: Operating System :: Microsoft :: Windows |
|
Classifier: Operating System :: OS Independent |
|
Classifier: Operating System :: POSIX |
|
Classifier: Operating System :: POSIX :: Linux |
|
Classifier: Programming Language :: Python :: 3.8 |
|
Classifier: Programming Language :: Python :: 3.9 |
|
Classifier: Programming Language :: Python :: 3.10 |
|
Classifier: Programming Language :: Python :: 3.11 |
|
Classifier: Programming Language :: Python :: 3.12 |
|
Classifier: Topic :: Software Development :: Libraries :: Python Modules |
|
Classifier: Typing :: Typed |
|
Requires-Python: >=3.8 |
|
Requires-Dist: anyio<5,>=3.5.0 |
|
Requires-Dist: distro<2,>=1.7.0 |
|
Requires-Dist: httpx<1,>=0.23.0 |
|
Requires-Dist: pydantic<3,>=1.9.0 |
|
Requires-Dist: sniffio |
|
Requires-Dist: typing-extensions<5,>=4.10 |
|
Description-Content-Type: text/markdown |
|
|
|
|
|
|
|
[](https://pypi.org/project/groq/) |
|
|
|
The Groq Python library provides convenient access to the Groq REST API from any Python 3.8+ |
|
application. The library includes type definitions for all request params and response fields, |
|
and offers both synchronous and asynchronous clients powered by [httpx](https://github.com/encode/httpx). |
|
|
|
It is generated with [Stainless](https://www.stainlessapi.com/). |
|
|
|
|
|
|
|
The REST API documentation can be found on [console.groq.com](https://console.groq.com/docs). The full API of this library can be found in [api.md](https://github.com/groq/groq-python/tree/main/api.md). |
|
|
|
|
|
|
|
```sh |
|
|
|
pip install groq |
|
``` |
|
|
|
|
|
|
|
The full API of this library can be found in [api.md](https://github.com/groq/groq-python/tree/main/api.md). |
|
|
|
```python |
|
import os |
|
from groq import Groq |
|
|
|
client = Groq( |
|
api_key=os.environ.get("GROQ_API_KEY"), |
|
) |
|
|
|
chat_completion = client.chat.completions.create( |
|
messages=[ |
|
{ |
|
"role": "user", |
|
"content": "Explain the importance of low latency LLMs", |
|
} |
|
], |
|
model="llama3-8b-8192", |
|
) |
|
print(chat_completion.choices[0].message.content) |
|
``` |
|
|
|
While you can provide an `api_key` keyword argument, |
|
we recommend using [python-dotenv](https://pypi.org/project/python-dotenv/) |
|
to add `GROQ_API_KEY="My API Key"` to your `.env` file |
|
so that your API Key is not stored in source control. |
|
|
|
|
|
|
|
Simply import `AsyncGroq` instead of `Groq` and use `await` with each API call: |
|
|
|
```python |
|
import os |
|
import asyncio |
|
from groq import AsyncGroq |
|
|
|
client = AsyncGroq( |
|
api_key=os.environ.get("GROQ_API_KEY"), |
|
) |
|
|
|
|
|
async def main() -> None: |
|
chat_completion = await client.chat.completions.create( |
|
messages=[ |
|
{ |
|
"role": "user", |
|
"content": "Explain the importance of low latency LLMs", |
|
} |
|
], |
|
model="llama3-8b-8192", |
|
) |
|
print(chat_completion.choices[0].message.content) |
|
|
|
|
|
asyncio.run(main()) |
|
``` |
|
|
|
Functionality between the synchronous and asynchronous clients is otherwise identical. |
|
|
|
|
|
|
|
Nested request parameters are [TypedDicts](https://docs.python.org/3/library/typing.html#typing.TypedDict). Responses are [Pydantic models](https://docs.pydantic.dev) which also provide helper methods for things like: |
|
|
|
- Serializing back into JSON, `model.to_json()` |
|
- Converting to a dictionary, `model.to_dict()` |
|
|
|
Typed requests and responses provide autocomplete and documentation within your editor. If you would like to see type errors in VS Code to help catch bugs earlier, set `python.analysis.typeCheckingMode` to `basic`. |
|
|
|
|
|
|
|
When the library is unable to connect to the API (for example, due to network connection problems or a timeout), a subclass of `groq.APIConnectionError` is raised. |
|
|
|
When the API returns a non-success status code (that is, 4xx or 5xx |
|
response), a subclass of `groq.APIStatusError` is raised, containing `status_code` and `response` properties. |
|
|
|
All errors inherit from `groq.APIError`. |
|
|
|
```python |
|
import groq |
|
from groq import Groq |
|
|
|
client = Groq() |
|
|
|
try: |
|
client.chat.completions.create( |
|
messages=[ |
|
{ |
|
"role": "system", |
|
"content": "You are a helpful assistant.", |
|
}, |
|
{ |
|
"role": "user", |
|
"content": "Explain the importance of low latency LLMs", |
|
}, |
|
], |
|
model="llama3-8b-8192", |
|
) |
|
except groq.APIConnectionError as e: |
|
print("The server could not be reached") |
|
print(e.__cause__) |
|
except groq.RateLimitError as e: |
|
print("A 429 status code was received; we should back off a bit.") |
|
except groq.APIStatusError as e: |
|
print("Another non-200-range status code was received") |
|
print(e.status_code) |
|
print(e.response) |
|
``` |
|
|
|
Error codes are as follows: |
|
|
|
| Status Code | Error Type | |
|
| ----------- | -------------------------- | |
|
| 400 | `BadRequestError` | |
|
| 401 | `AuthenticationError` | |
|
| 403 | `PermissionDeniedError` | |
|
| 404 | `NotFoundError` | |
|
| 422 | `UnprocessableEntityError` | |
|
| 429 | `RateLimitError` | |
|
| >=500 | `InternalServerError` | |
|
| N/A | `APIConnectionError` | |
|
|
|
|
|
|
|
Certain errors are automatically retried 2 times by default, with a short exponential backoff. |
|
Connection errors (for example, due to a network connectivity problem), 408 Request Timeout, 409 Conflict, |
|
429 Rate Limit, and >=500 Internal errors are all retried by default. |
|
|
|
You can use the `max_retries` option to configure or disable retry settings: |
|
|
|
```python |
|
from groq import Groq |
|
|
|
|
|
client = Groq( |
|
|
|
max_retries=0, |
|
) |
|
|
|
|
|
client.with_options(max_retries=5).chat.completions.create( |
|
messages=[ |
|
{ |
|
"role": "system", |
|
"content": "You are a helpful assistant.", |
|
}, |
|
{ |
|
"role": "user", |
|
"content": "Explain the importance of low latency LLMs", |
|
}, |
|
], |
|
model="llama3-8b-8192", |
|
) |
|
``` |
|
|
|
|
|
|
|
By default requests time out after 1 minute. You can configure this with a `timeout` option, |
|
which accepts a float or an [`httpx.Timeout`](https://www.python-httpx.org/advanced/#fine-tuning-the-configuration) object: |
|
|
|
```python |
|
from groq import Groq |
|
|
|
|
|
client = Groq( |
|
|
|
timeout=20.0, |
|
) |
|
|
|
|
|
client = Groq( |
|
timeout=httpx.Timeout(60.0, read=5.0, write=10.0, connect=2.0), |
|
) |
|
|
|
|
|
client.with_options(timeout=5.0).chat.completions.create( |
|
messages=[ |
|
{ |
|
"role": "system", |
|
"content": "You are a helpful assistant.", |
|
}, |
|
{ |
|
"role": "user", |
|
"content": "Explain the importance of low latency LLMs", |
|
}, |
|
], |
|
model="llama3-8b-8192", |
|
) |
|
``` |
|
|
|
On timeout, an `APITimeoutError` is thrown. |
|
|
|
Note that requests that time out are [retried twice by default](https://github.com/groq/groq-python/tree/main/#retries). |
|
|
|
|
|
|
|
|
|
|
|
We use the standard library [`logging`](https://docs.python.org/3/library/logging.html) module. |
|
|
|
You can enable logging by setting the environment variable `GROQ_LOG` to `info`. |
|
|
|
```shell |
|
$ export GROQ_LOG=info |
|
``` |
|
|
|
Or to `debug` for more verbose logging. |
|
|
|
|
|
|
|
In an API response, a field may be explicitly `null`, or missing entirely; in either case, its value is `None` in this library. You can differentiate the two cases with `.model_fields_set`: |
|
|
|
```py |
|
if response.my_field is None: |
|
if 'my_field' not in response.model_fields_set: |
|
print('Got json like {}, without a "my_field" key present at all.') |
|
else: |
|
print('Got json like {"my_field": null}.') |
|
``` |
|
|
|
|
|
|
|
The "raw" Response object can be accessed by prefixing `.with_raw_response.` to any HTTP method call, e.g., |
|
|
|
```py |
|
from groq import Groq |
|
|
|
client = Groq() |
|
response = client.chat.completions.with_raw_response.create( |
|
messages=[{ |
|
"role": "system", |
|
"content": "You are a helpful assistant.", |
|
}, { |
|
"role": "user", |
|
"content": "Explain the importance of low latency LLMs", |
|
}], |
|
model="llama3-8b-8192", |
|
) |
|
print(response.headers.get('X-My-Header')) |
|
|
|
completion = response.parse() |
|
print(completion.id) |
|
``` |
|
|
|
These methods return an [`APIResponse`](https://github.com/groq/groq-python/tree/main/src/groq/_response.py) object. |
|
|
|
The async client returns an [`AsyncAPIResponse`](https://github.com/groq/groq-python/tree/main/src/groq/_response.py) with the same structure, the only difference being `await`able methods for reading the response content. |
|
|
|
|
|
|
|
The above interface eagerly reads the full response body when you make the request, which may not always be what you want. |
|
|
|
To stream the response body, use `.with_streaming_response` instead, which requires a context manager and only reads the response body once you call `.read()`, `.text()`, `.json()`, `.iter_bytes()`, `.iter_text()`, `.iter_lines()` or `.parse()`. In the async client, these are async methods. |
|
|
|
```python |
|
with client.chat.completions.with_streaming_response.create( |
|
messages=[ |
|
{ |
|
"role": "system", |
|
"content": "You are a helpful assistant.", |
|
}, |
|
{ |
|
"role": "user", |
|
"content": "Explain the importance of low latency LLMs", |
|
}, |
|
], |
|
model="llama3-8b-8192", |
|
) as response: |
|
print(response.headers.get("X-My-Header")) |
|
|
|
for line in response.iter_lines(): |
|
print(line) |
|
``` |
|
|
|
The context manager is required so that the response will reliably be closed. |
|
|
|
|
|
|
|
This library is typed for convenient access to the documented API. |
|
|
|
If you need to access undocumented endpoints, params, or response properties, the library can still be used. |
|
|
|
|
|
|
|
To make requests to undocumented endpoints, you can make requests using `client.get`, `client.post`, and other |
|
http verbs. Options on the client will be respected (such as retries) when making this request. |
|
|
|
```py |
|
import httpx |
|
|
|
response = client.post( |
|
"/foo", |
|
cast_to=httpx.Response, |
|
body={"my_param": True}, |
|
) |
|
|
|
print(response.headers.get("x-foo")) |
|
``` |
|
|
|
|
|
|
|
If you want to explicitly send an extra param, you can do so with the `extra_query`, `extra_body`, and `extra_headers` request |
|
options. |
|
|
|
|
|
|
|
To access undocumented response properties, you can access the extra fields like `response.unknown_prop`. You |
|
can also get all the extra fields on the Pydantic model as a dict with |
|
[`response.model_extra`](https://docs.pydantic.dev/latest/api/base_model/#pydantic.BaseModel.model_extra). |
|
|
|
|
|
|
|
You can directly override the [httpx client](https://www.python-httpx.org/api/#client) to customize it for your use case, including: |
|
|
|
- Support for [proxies](https://www.python-httpx.org/advanced/proxies/) |
|
- Custom [transports](https://www.python-httpx.org/advanced/transports/) |
|
- Additional [advanced](https://www.python-httpx.org/advanced/clients/) functionality |
|
|
|
```python |
|
import httpx |
|
from groq import Groq, DefaultHttpxClient |
|
|
|
client = Groq( |
|
|
|
base_url="http://my.test.server.example.com:8083", |
|
http_client=DefaultHttpxClient( |
|
proxy="http://my.test.proxy.example.com", |
|
transport=httpx.HTTPTransport(local_address="0.0.0.0"), |
|
), |
|
) |
|
``` |
|
|
|
You can also customize the client on a per-request basis by using `with_options()`: |
|
|
|
```python |
|
client.with_options(http_client=DefaultHttpxClient(...)) |
|
``` |
|
|
|
|
|
|
|
By default the library closes underlying HTTP connections whenever the client is [garbage collected](https://docs.python.org/3/reference/datamodel.html#object.__del__). You can manually close the client using the `.close()` method if desired, or with a context manager that closes when exiting. |
|
|
|
```py |
|
from groq import Groq |
|
|
|
with Groq() as client: |
|
|
|
... |
|
|
|
|
|
``` |
|
|
|
|
|
|
|
This package generally follows [SemVer](https://semver.org/spec/v2.0.0.html) conventions, though certain backwards-incompatible changes may be released as minor versions: |
|
|
|
1. Changes that only affect static types, without breaking runtime behavior. |
|
2. Changes to library internals which are technically public but not intended or documented for external use. _(Please open a GitHub issue to let us know if you are relying on such internals.)_ |
|
3. Changes that we do not expect to impact the vast majority of users in practice. |
|
|
|
We take backwards-compatibility seriously and work hard to ensure you can rely on a smooth upgrade experience. |
|
|
|
We are keen for your feedback; please open an [issue](https://www.github.com/groq/groq-python/issues) with questions, bugs, or suggestions. |
|
|
|
|
|
|
|
If you've upgraded to the latest version but aren't seeing any new features you were expecting then your python environment is likely still using an older version. |
|
|
|
You can determine the version that is being used at runtime with: |
|
|
|
```py |
|
import groq |
|
print(groq.__version__) |
|
``` |
|
|
|
|
|
|
|
Python 3.8 or higher. |
|
|
|
|
|
|
|
See [the contributing documentation](https://github.com/groq/groq-python/tree/main/./CONTRIBUTING.md). |
|
|