Using a Database as a Message Queue - Coding

Using a database as a message queue might seem convenient and cost-effective initially, but it introduces significant drawbacks. This approach can lead to performance bottlenecks, increased complexity, and scalability issues. Traditional message queuing systems are specifically designed to handle high throughput and concurrency, whereas databases are not optimized for such tasks. Relying on a database for message queuing can ultimately result in inefficiencies and potential system failures.

Important Topics for Using a Database as a Message Queue

what are Message Queues?
Differences Between Message Queues and Databases
Why Using a Database as a Message Queue is a bad choice?
Real-world scenarios of Failures
Comparison between dedicated message queue systems and DB as a message queue

What are Message Queues?

Message queues are means by which messages are passed between distributed systems or between the components, or the applications in the composite systems. They cause interaction to be asynchronous and de-coupled as the incoming messages are stored in the queue until the receiver is ready for them. This mechanism improves dependability and extendibility since messages can be queued and subsequently delivered if the receiving system is offline.

Differences Between Message Queues and Databases

Below are the differences between the Message Queues and Databases:

Parameters	Message Queues	Databases
Primary Purpose	Facilitates asynchronous communication and task management	Stores retrieves, and manages structured data persistently
Data Persistence	Messages are typically transient but can be persistent	Data is persistently stored and designed for long-term retention
Access Pattern	FIFO (First In, First Out) or LIFO (Last In, First Out)	Random access via queries using SQL or other query languages
Latency	Low-latency, suitable for real-time processing	Latency depends on query complexity and database size
Scalability	Horizontally scalable by adding more consumers/producers	Can be horizontally and vertically scalable with more complex configurations
Transaction Support	Basic transaction support (e.g., message acknowledgment)	Advanced transaction support with ACID properties
Data Model	Simple, message-oriented data model	Complex, relational, or NoSQL data models
Concurrency Handling	Manages message ordering and delivery guarantees	Manages concurrent access through locks, isolation levels
Data Volume	Handles high volumes of small messages efficiently	Handles large volumes of structured data

Why Using a Database as a Message Queue is a bad choice?

Using a database as a message queue might seem like a convenient solution, but it brings several significant drawbacks:

1. Performance Issues

Latency: Databases are optimized for data storage and retrieval, not for high-throughput, low-latency message processing. This can result in increased latency for message delivery.
Overhead: Databases introduce overhead due to their complex transaction management, indexing, and logging mechanisms, which can slow down message processing.

2. Scalability Challenges

Limited Throughput: Databases are not designed to handle the high throughput required by messaging systems, leading to bottlenecks as the volume of messages increases.
Resource Contention: As the number of messages grows, database resources (CPU, memory, disk I/O) become strained, impacting both the message queue and other database operations.

3. Reliability and Fault Tolerance Concerns

Single Point of Failure: Using a single database instance for message queuing creates a single point of failure, risking message loss or corruption during database outages.
Limited Fault Tolerance: Traditional databases lack the robust fault tolerance and redundancy mechanisms that specialized message queuing systems offer.

4. Concurrency and Transaction Management

Lock Contention: High levels of concurrent access to the database can lead to lock contention, causing delays and reducing throughput.
Complex Transactions: Managing message visibility and ensuring exactly-once delivery can complicate transaction management, often requiring custom logic that is error-prone and difficult to maintain.

Real-world scenarios of Failures while using Database as a Message Queue

1. Performance Degradation

A database is employed by an e-commerce platform to store orders and also to have a queue of notifications intended for customers.

Failure:

When the products are on the flash sale, it means that many orders and notifications reaching the users. The database receives the order processing and message queuing tasks which heavily impacts the system performance. This means that the order processing takes a long time, notification of the customers is also delayed, and this in turn hampers user experience. Customers could be receiving notifications after hours, some orders could not even be processed on time.

2. Concurrency Problems

A financial service employs the use of a queue database for holding and managing the transactions that require fraud checks.

Failure:

Many anti-fraud checks attempt to scan and analyse transactions at the same time, which leads to the use of databases, locks and contention. This leads to the formation of deadlocks, and timeout issues in transactions thus slowing the identification of fraud. Sometimes, various transactions are not completed on time and result in monetary losses and, possibly, a security threat.

3. Resource Contention

An example of a persistent data structure is a queue that a ride-sharing application uses to store ride requests and driver assignments.

Failure:

Due to the high level of ride requests and assignments pressure is exerted on the database. CPU and memory usage by the database server is spent on maintaining both normal data and messages, and it turns into a task of resource contending. Consequently, the app won’t respond, ride requests do not get processed, and a user cannot book a ride.

4. Complex Maintenance

An application that is related to the healthcare field utilizes a database to hold information that relates to notifying patients about their upcoming appointments.

Failure:

In this case, managing a queue includes quite complex logic that has to prevent further processing of already processed messages and should handle retry properly. What happens in the course of the evaluation of such requirements is that the custom logic’s maintenance becomes time-consuming and prone to errors. There is a small glitch in the cleanup script that over a period, causes unprocessed messages to accumulate and fill the database and lead to the crashing of the system and missed appointments.

Comparison between dedicated message queue systems and Database as a message queue

Below is a comprehensive comparison between dedicated message queue systems and database as a message queue:

Parameters	Dedicated Message Queue Systems	Database as a Message Queue
Primary Purpose	Asynchronous message passing, decoupled communication	Primary data storage, not optimized for message passing
Performance and Latency	Low-latency, high-throughput message processing	Higher latency due to transactional overhead and indexing
Scalability	Horizontally scalable by adding more nodes or brokers	Scaling is complex and often limited by database architecture
Concurrency Handling	Built-in mechanisms for message ordering and delivery guarantees	Relies on database locks, leading to potential contention
Transaction Support	Basic transaction support (e.g., message acknowledgment)	Full ACID compliance for data transactions
Message Retention	Messages can be retained based on configuration settings	Messages might be deleted after processing, requiring custom logic for retention
Reliability	Built-in features for retries, dead-letter queues, and fault tolerance	Reliability features are not designed for transient messages
Resource Utilization	Optimized for handling message queues efficiently	Increased resource contention due to mixed workload
Failure Handling	Designed to handle message delivery failures gracefully	Lack of specialized mechanisms for transient message failures

Reffered: https://www.geeksforgeeks.org

System Design

Related
Design a Live Streaming App like Twitch - System Design
Consistency vs. Availability in System Design
Coding Standards and Best Practices for System Design
What is the Relationship Between Kubernetes and Google Kubernetes Engine?
Google Autopilot - Autoscaling

Type:	Geek
Category:	Coding
Sub Category:	Tutorial
Uploaded by:	Admin
Views:	17