- Notifications
You must be signed in to change notification settings - Fork4k
[Questions] create quorum queue failed#15099
Uh oh!
There was an error while loading.Please reload this page.
Uh oh!
There was an error while loading.Please reload this page.
-
Community Support Policy
RabbitMQ version used4.1.2 Erlang version used26.2.x Operating system (distribution) usedlinux How is RabbitMQ deployed?Generic binary package rabbitmq-diagnostics status outputSeehttps://www.rabbitmq.com/docs/cli to learn how to use rabbitmq-diagnostics DetailsLogs from node 1 (with sensitive values edited out)Seehttps://www.rabbitmq.com/docs/logging to learn how to collect logs DetailsLogs from node 2 (if applicable, with sensitive values edited out)Seehttps://www.rabbitmq.com/docs/logging to learn how to collect logs DetailsLogs from node 3 (if applicable, with sensitive values edited out)Seehttps://www.rabbitmq.com/docs/logging to learn how to collect logs Detailsrabbitmq.confSeehttps://www.rabbitmq.com/docs/configure#config-location to learn how to find rabbitmq.conf file location DetailsSteps to deploy RabbitMQ clusterconfig cluster and start Steps to reproduce the behavior in questionso many quorum queue create advanced.configSeehttps://www.rabbitmq.com/docs/configure#config-location to learn how to find advanced.config file location DetailsApplication codeDetails# PASTE CODE HERE, BETWEEN BACKTICKSKubernetes deployment fileDetails# Relevant parts of K8S deployment that demonstrate how RabbitMQ is deployed# PASTE YAML HERE, BETWEEN BACKTICKS What problem are you trying to solve?When my client restarts in batches (rolling upgrade), a large number of queue(1w+ quorum queues and 5w+ exclusive queues) registration requests are generated, and I occasionally receive some error messages. At the same time, we periodically call the API: /api/health/checks/port-listener/5673 to check whether the server status is normal. During the client start, this API also frequently reports errors. It seems that some I/O timeout errors occurred when creating the queue, and then registering consumers generated a large number of noproc exceptions. |
BetaWas this translation helpful?Give feedback.
All reactions
@dormanze inject a random delay in the 1-15s range to your clients so that they do not all run their declarations at once.
There were several efficiency improvements around Khepri in the upcoming4.2.2 release.
Replies: 3 comments 7 replies
-
Around the time you restarted there will most likely be a stack trace with the full error and reason for why some processes did not start. Those are the logs I need to investigate further. |
BetaWas this translation helpful?Give feedback.
All reactions
-
Could you tell me how to enable logging for this? I noticed that only my temporary exclusive queue is experiencing timeouts; the quorum queue does not. |
BetaWas this translation helpful?Give feedback.
All reactions
-
Errors are always logged by default, you just need to supply all logs from all nodes and we can take a look. |
BetaWas this translation helpful?Give feedback.
All reactions
-
@dormanze any update in providing logs covering the timeframe of the restart? |
BetaWas this translation helpful?Give feedback.
All reactions
-
Thank you for your reply. I am having some difficulty uploading the logs. I have carefully checked the startup logs and found no obvious errors. I will try version 4.2.x later to see if it can resolve my issue. |
BetaWas this translation helpful?Give feedback.
All reactions
-
@dormanze inject a random delay in the 1-15s range to your clients so that they do not all run their declarations at once. There were several efficiency improvements around Khepri in the upcoming |
BetaWas this translation helpful?Give feedback.
All reactions
-
Thank you for your response. Unfortunately, we cannot do that because we have a large number of queues, and we need to ensure the upgrade duration when performing rolling upgrades on the client. |
BetaWas this translation helpful?Give feedback.
All reactions
-
I noticed in my observation logs that not only does creating queues sometimes time out, but adding bindings also experiences timeouts. |
BetaWas this translation helpful?Give feedback.
All reactions
-
All operations on a queue without an elected leader (because it timed out when enough of them are triggered at once) will fail, this is how Raft works. |
BetaWas this translation helpful?Give feedback.
All reactions
-
Yes, it seems that all of this is related to my switching to khepri_db. I will roll back that capability first. Thank you for your suggestion. |
BetaWas this translation helpful?Give feedback.

