Rate Limits

Harvey API implements rate limiting to ensure fair usage and maintain service stability. Rate limits are applied per organization and are reset every minute.

Rate Limits by Endpoint

Endpoint Category	Rate Limit (requests/minute)
Assistant Completion Endpoint	20
Vault API Endpoints	10
Audit Log Endpoints	60
History Export Endpoints	60
Client Matters Endpoints	150

Handling Rate Limits

When you exceed the rate limit, the API returns a 429 Too Many Requests status code. To handle rate limits effectively:

Monitor the rate limit headers to track your usage
Implement exponential backoff when you receive a 429 response
Space out your requests to stay within the limits
Consider batching operations where possible to reduce the number of API calls

Getting Started

Assistant

Vault

History Export

Audit Logs

Client Matters

Rate Limits by Endpoint

Handling Rate Limits

Getting Started

Assistant

Vault

History Export

Audit Logs

Client Matters

​Rate Limits by Endpoint

​Handling Rate Limits

Rate Limits by Endpoint

Handling Rate Limits