Rate limiting controls the number of requests a client can make to a service within a specified time window. It's commonly used to protect backend services from abuse and ensure fair usage.