> ## Documentation Index
> Fetch the complete documentation index at: https://developers.harvey.ai/llms.txt
> Use this file to discover all available pages before exploring further.

# Rate Limits

Harvey API implements rate limiting to ensure fair usage and maintain service stability. Rate limits are applied per organization and are reset every minute.

## Rate Limits by Endpoint

| Endpoint Category                 | Rate Limit (requests/minute) |
| --------------------------------- | ---------------------------- |
| **Assistant Completion Endpoint** | 20                           |
| **Vault API Endpoints**           | 10                           |
| **Audit Log Endpoints**           | 60                           |
| **History Export Endpoints**      | 60                           |
| **Client Matters Endpoints**      | 150                          |

## Handling Rate Limits

When you exceed the rate limit, the API returns a `429 Too Many Requests` status code. To handle rate limits effectively:

1. **Monitor the rate limit headers** to track your usage
2. **Implement exponential backoff** when you receive a 429 response
3. **Space out your requests** to stay within the limits
4. **Consider batching operations** where possible to reduce the number of API calls
