Should message fan in be avoided? If so, how?

bshurn · February 16, 2023, 12:16am

Hi all,

I am designing a solution to manage MQTT devices & have a question about message fan in – where a large number of clients (100K+) publish messages to a single topic tree.

Some background…
In order for the solution to manage devices, it needs to know some information about each and every device. At startup each client sends a message which includes the information the solution needs for device management.

The question is… which topic should each client publish to?

Option 1: Single topic tree
Clients publish to: clientinformation/[client-id]

Concern: Message fan-in.
In this approach multiple clients are publishing to one solution side client subscribed to the wildcard topic ‘clientinformation/+’.

Having one client responsible for processing these messages raises concerns around latency and rate limiting. This also seems to give us no ability to scale horizontally.

Are these concerns valid? Are there well known solutions to this type of problem?

Option 2: Partitioned topics
Assuming the concerns from Option #1 are valid, I am considering the alternative where each client is assigned a partition as part of the bootstrapping process (when they discover the broker endpoint).
This partition information would be included in the topic tree to allow for concurrent processing.

Topic template: clientinformation/[partition]/[client-id]
For example:
ClientA publishes to clientinformation/1/clienta
ClientB publishes to clientinformation/2/clientb
ClientC publishes to clientinformation/N/clientc

The solution would then deploy N clients, each subscribed to a single partition:
Solution1 subscribes to clientinformation/1/+
Solution2 subscribes to clientinformation/2/+
SolutionN subscribes to clientinformation/N/+

Concerns here are:

Is this a problem we need to solve, or is this strictly a theoretical concern
Are there well established solutions to this problem?
How we’ll reliably deliver this partition information to devices.
How or if we’ll need to repartition.

Daria_H · February 16, 2023, 2:58pm

Hi @bshurn ,

Thank you for your interest in MQTT, welcome to the community!

You can use Shared Subscription to balance the load of a single subscription across multiple MQTT clients.

For Enterprise level solutions it is possible to stream messages from a HiveMQ broker to Kafka.

I hope it helps.
Kind regards,
Dasha from HiveMQ team

Topic		Replies	Views
Best practice for receiving messages HiveMQ Client Library	3	1881	January 4, 2022
Does HiveMQ support multiple tenants? Identical devices should use the same topics and message should only be visible to corresponding users	3	1129	September 8, 2022
Send MQTT message to specific client?	4	1567	November 17, 2022
Topic Design Best Practices MQTT	0	670	July 1, 2021
How can I build my topics if I have multiple machines and users? HiveMQ Community Edition	5	627	May 29, 2022

Should message fan in be avoided? If so, how?

Related topics