Harvey API implements rate limiting to ensure fair usage and maintain service stability. Rate limits are applied per organization and are reset every minute.Documentation Index
Fetch the complete documentation index at: https://developers.harvey.ai/llms.txt
Use this file to discover all available pages before exploring further.
Rate Limits by Endpoint
| Endpoint Category | Rate Limit (requests/minute) |
|---|---|
| Assistant Completion Endpoint | 20 |
| Vault API Endpoints | 10 |
| Audit Log Endpoints | 60 |
| History Export Endpoints | 60 |
| Client Matters Endpoints | 150 |
Handling Rate Limits
When you exceed the rate limit, the API returns a429 Too Many Requests status code. To handle rate limits effectively:
- Monitor the rate limit headers to track your usage
- Implement exponential backoff when you receive a 429 response
- Space out your requests to stay within the limits
- Consider batching operations where possible to reduce the number of API calls