[IDEA] Granular Message Storage Settings #6255

thorst · 2024-07-11T17:49:42Z

Is your feature request related to a problem? Please describe.
Our problem is that our database is currently 6 tb. Our devs like 45 days of history and are unwilling to budge on that. In fact, they would like more.

Describe your use case
Having granular settings for what content is stored is imperative for us to be better stewards of our data and disk requirements. In cloverleaf we only stored the raw inbound from the inbound thread, and what was sent in the outbound thread. In Mirth, that would be the ability to save the raw message from the source connector, and the final state in the outbound connectors.

Describe the solution you'd like
I would like to have the message storage section in the channel configuration, summary tab, to have an advanced button. You would click on that and select which states you want to save for each connector. Each state would have a checkbox, and each connector would be a row in a table, with associated state checkboxes.

I realize this is much less user friendly than the current solution, so I would keep the current solution for the regular folks, but power users need more configuration. We don't need all of this data stored, and would rather have just the raw, and resend the message if needed, and store the data for longer. Imagine storing 90 days of messages, for less disk usage than we are currently using with 45 days.

Add to that, that we are now discussing how we will upgrade PostgreSQL. The going procedure is to export, uninstall Postgres, install new version of Postgres, and then import. This would take prohibitively long with 6 tb.

Describe alternatives you've considered

Store less - our devs prefer the ability to be able to go back 45 days, if possible, they are very adamant about not going lower.
Use global pre/post processor - I am already getting concurrency errors with the post processor, that's a different topic, but we could store the transaction to an external db. This is nice because users don't have to make any changes. We would most likely want an exclusion list, for channels where we don't need to save a copy, but ultimately a little more storage than what we need is fine, it would still be way less than what is being stored currently.
Code template - Users could call a code template where they wanted to store the transaction, and that would save to an external db. This is perfect in that it only saves exactly when you want it to, but bad because you have to remember to call it.
Archiving - I could potentially use the archiver to save the files out to a directory, which a channel could read from, and insert the data into the db. This seems too clunky though, and without testing Im afraid of it trying to clean up old files (that no longer exist because they are read in by another process) - but I should state that Im not sure what the use case is for that feature.
Database Scraper - You could have a channel or external script/process loop over the database tables and extract the data you want. The positive of this is that it could reside in an external process, so mirth isn't getting bogged down with the details. The negative is that it would be slightly delayed. Could be intensive if it ran too fast, say, to loop over the previous days worth of data
Clustering would allow you to upgrade one db at a time

All of these solutions assume an external database, which could be down and cause issues with the process. There could be issues in my code. I would need to build all this, which is fine, but an official solution would be preferred.

Messages would be stored in a cold storage db which would mean they would be harder to search and retrieve. It would pull them out of their current workflow, and while I could get snazzy and add a resend button that takes the data and calls the client api, it would be harder to resend when compared to the current provided solution in the message history. The obvious answer is that mirth does the decision making of whether to store it or not at the time of saving, to never occupy the disk space to begin with. Then the user interacts with the messages the same as always in the message history.

thorst · 2024-07-12T01:21:46Z

An alternative could be a "first and last" setting, where it saves the raw on the source connector, and the sent transaction for each destination. This is much less optimized and configurable, but probably easier for you all to implement.

It wouldn't solve all issues for everyone, but I am sure many would use it.

kirbykn2 · 2024-07-15T16:34:01Z

Curious, what is the use case for storing 45 days or more in Mirth?

…

On Thu, Jul 11, 2024 at 9:22 PM Todd Horst ***@***.***> wrote: An alternative could be a "first and last" setting, where it saves the raw in inbound, and 1 outbound for each destination. This is much less optimized and configurable, but probably easier for you all to implement. It wouldn't solve all issues for everyone, but I am sure many would use it. — Reply to this email directly, view it on GitHub <#6255 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/APRXWD6VVKSCAK6OC2OANGTZL4VUTAVCNFSM6AAAAABKXOPFMSVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDEMRUGI2TANBTHA> . You are receiving this because you are subscribed to this thread.Message ID: ***@***.***>

-- Best, Kirby Knight | 231.735.4650 | ***@***.***

thorst · 2024-07-15T16:45:02Z

Just convenience. I'm not the one setting the requirement, but for systems where you don't have access to resend from the source, the interface engine is the next best thing and faster than interacting with the vendor.

kirbykn2 · 2024-07-15T17:42:15Z

You pay for convenience. I would really question resending from Mirth messages that are that old. How often does it happen? Why does it happen? Should you be using database level resources (storing in Mirth for that long)? I would they do not have to resend messages in MIrth that often, and if they do, I would like to fix the issue that is causing resends. I totally understand the difficulty with interacting with vendors, and think there are better solutions than storing messages in Mirth.

…

On Mon, Jul 15, 2024 at 12:45 PM Todd Horst ***@***.***> wrote: Just convenience. I'm not the one setting the requirement, but for systems where you don't have access to resend from the source, the interface engine is the next best thing and faster than interacting with the vendor. — Reply to this email directly, view it on GitHub <#6255 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/APRXWD7IWWLCQ4BMZ7YXIB3ZMP4CHAVCNFSM6AAAAABKXOPFMSVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDEMRYHE2DQOJUGQ> . You are receiving this because you commented.Message ID: ***@***.***>

-- Best, Kirby Knight | 231.735.4650 | ***@***.***

thorst · 2024-07-15T18:10:12Z

You pay for convenience.
For sure, our disk usage and associated cost is high, and mirth uses more than what our old cloverleaf system does, and that's what this ticket is about.

I would really question resending from Mirth messages that are that old. How often does it happen? Why does it happen?
Sometimes you can use the history to build knowledge on all the transactions sent for a patient. Other times a downstream consumer will state they didn't get, or misplaced a message (like a result) and want us to resend. Or they received it, but it didn't file because of some piece of data on the message, at which point we would manually hack the message to get it to file into the system (very edge case). So of course you wouldn't want to send data that had been corrected or otherwise superseded, but there are plenty of cases where this is perfectly fine.

Should you be using database level resources (storing in Mirth for that long)? I would, they do not have to resend messages in MIrth that often, and if they do, I would like to fix the issue that is causing resends.
Current plan is to write the message history to a mysql db, that way I can upgrade our primary postgres db, which will be much smaller, and then separately upgrade mysql. Since mysql would just be message history, it could have a longer downtime and still not impact patient care. I can also get pickier about what I store, and so the size will be much smaller than what mirth is storing currently. It would be much nicer if we could make these granular tweaks in mirth, and then store much less, and just keep it all in one db. For now though, I will store in the backup db, and it'll be paired down. I wouldn't describe them as needing to resend often, or that there is a common cause. With a complex system there will always be things that pop up here or there, and we could just say, "oh, no, we can't do that". But where possible we like to say, "sure, give me 10 minutes". So with my new system we would put the pruner to be very short, like 2-7 days, and rely on the archive system to come into play in the situations where it's needed. In cloverleaf we used to, several version ago, have everyhting be file based. That was nice for compression, but then whenever you need to search for something it was a PITA to deal with.

thorst added the enhancement New feature or request label Jul 11, 2024

thorst mentioned this issue Jul 11, 2024

[BUG] Loading statistics and deploying channels slow on service restart #6186

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[IDEA] Granular Message Storage Settings #6255

[IDEA] Granular Message Storage Settings #6255

thorst commented Jul 11, 2024 •

edited

Loading

thorst commented Jul 12, 2024 •

edited

Loading

kirbykn2 commented Jul 15, 2024 via email

thorst commented Jul 15, 2024

kirbykn2 commented Jul 15, 2024 via email

thorst commented Jul 15, 2024 •

edited

Loading

[IDEA] Granular Message Storage Settings #6255

[IDEA] Granular Message Storage Settings #6255

Comments

thorst commented Jul 11, 2024 • edited Loading

thorst commented Jul 12, 2024 • edited Loading

kirbykn2 commented Jul 15, 2024 via email

thorst commented Jul 15, 2024

kirbykn2 commented Jul 15, 2024 via email

thorst commented Jul 15, 2024 • edited Loading

thorst commented Jul 11, 2024 •

edited

Loading

thorst commented Jul 12, 2024 •

edited

Loading

thorst commented Jul 15, 2024 •

edited

Loading