Monitoring AWS SQS

Published in

AWS in Plain English

5 min readMar 4, 2024

Amazon Simple Queue Service (SQS) is a fully managed message queuing service that enables you to decouple and scale microservices, distributed systems, and serverless applications. You can read more about SQS here:

AWS SQS: What, why, when?

Everything to know about Amazon Simple Queue Service (SQS)…

ayushblog.medium.com

Effective monitoring of SQS can dramatically improve the reliability, efficiency, and scalability of your applications. This blog post explores the essential metrics and alarms for AWS SQS to ensure your queueing system operates at its peak.

Let’s have a look

Essential Metrics

1. ApproximateNumberOfMessagesVisible

This metric indicates the number of messages available for retrieval and processing. It’s a direct measure of your queue’s workload and an essential metric for identifying backlogs.

This message gives you an idea of when to scale your consumers to effectively reduce backlog and maintain SLA.

2. ApproximateNumberOfMessagesNotVisible

Messages being processed are not visible in the queue. This metric helps you gauge the rate of message processing and the efficiency of your consumers.
This can also be used to see the number of In-flight messages.

3. ApproximateNumberOfMessagesDelayed

This metric counts the messages delayed from immediate delivery i.e. messages sent to the queue with a delay. Monitoring this can help you manage and optimize message delivery timings.

4. NumberOfMessagesSent

The total number of messages added to the queue provides insight into the production rate, which is crucial for scaling purposes.

5. NumberOfMessagesReceived

The count of messages retrieved for processing, which, when compared with the number of messages sent, offers insights into the overall throughput of your system.

6. NumberOfMessagesDeleted

Tracks the number of successfully processed and deleted messages, indicating the health of your message processing.

7. SentMessageSize

Monitoring the size of your messages can help in optimizing performance and managing costs.

8. ApproximateAgeOfOldestMessage

This metric represents the time in seconds that the oldest message has been in the queue. It provides invaluable insight into the processing throughput and potential bottlenecks within your messaging system.

A high value indicates that messages are waiting too long to be processed. This could signal that your consumer processes are unable to keep up with the incoming message rate or that there is an issue preventing messages from being processed efficiently.

Essential Alarms for SQS

Implementing alarms based on the metrics above can help you maintain optimal performance and swiftly respond to issues.

1. `ApproximateNumberOfMessagesVisible`

Set an alarm for when ApproximateNumberOfMessagesVisible exceeds a threshold, indicating a processing backlog or slow consumers.

2. ApproximateAgeOfOldestMessage

Set an alarm based on the ApproximateAgeOfOldestMessage metric in AWS SQS is crucial for ensuring that messages are being processed on time. A high value for this metric could indicate that there is a bottleneck in your message processing, or that consumers are not keeping up with the producer, potentially leading to delayed data processing or system inefficiency.

3. Message Throughput Alarm

Alarms for significant deviations in NumberOfMessagesSent and NumberOfMessagesReceived can highlight issues in message production or consumption rates.

4. Error Rate Alarm

Monitoring for an increase in failed processes or undeliverable messages is crucial for maintaining system integrity.

Best Practices for SQS Monitoring

Customize Alarm Thresholds: Tailor your alarm thresholds based on historical data and expected application behavior to avoid false positives.
Utilize CloudWatch Dashboards: Create comprehensive dashboards in Amazon CloudWatch to visualize your SQS metrics and alarms in real time, enabling quick assessments and decisions.
Implement Dead Letter Queues: Use SQS Dead Letter Queues (DLQs) to sideline and analyze messages that can’t be processed successfully after several attempts.
Regular Review: Continuously monitor and review your SQS metrics and alarm settings to adjust for changes in application usage patterns or business requirements.

Effectively monitoring AWS SQS with the right set of metrics and alarms not only ensures the smooth operation of your message queuing system but also helps in identifying and mitigating issues before they impact your application’s performance. Embrace these practices to keep your AWS SQS queues healthy, efficient, and aligned with your operational goals.

That’s all about “Monitoring AWS SQS”. Send us your feedback using the message button below. Your feedback helps us create better content for you and others. Thanks for reading!

If you like the article please click the 👏🏻 button below a few times. To show your support!

Follow us on Twitter and LinkedIn

In Plain English 🚀

Thank you for being a part of the In Plain English community! Before you go:

Be sure to clap and follow the writer ️👏️️
Follow us: X | LinkedIn | YouTube | Discord | Newsletter
Visit our other platforms: Stackademic | CoFeed | Venture | Cubed
More content at PlainEnglish.io