Getting Started
Rate Limits
Harvey API implements rate limiting to ensure fair usage and maintain service stability. Rate limits are applied per organization and are reset every minute.
Rate Limits by Endpoint
Endpoint Category | Rate Limit (requests/minute) |
---|---|
Assistant Completion Endpoint | 20 |
Vault API Endpoints | 10 |
Audit Log Endpoints | 60 |
History Export Endpoints | 60 |
Client Matters Endpoints | 150 |
Handling Rate Limits
When you exceed the rate limit, the API returns a 429 Too Many Requests
status code. To handle rate limits effectively:
- Monitor the rate limit headers to track your usage
- Implement exponential backoff when you receive a 429 response
- Space out your requests to stay within the limits
- Consider batching operations where possible to reduce the number of API calls