Handling large X12s #460

cpotter1 · 2024-05-07T04:39:38Z

cpotter1
May 7, 2024

I have use cases that involve storing and processing X12s in excess of 300MB each.

Given this, I was considering implementing a chunking strategy where each file would be <= 4k in size chunks. I can create a facade InputStream to pass to the reader, but I was also hoping to not have to start processing each file from the start each time (offset 0). I'd much prefer to start processing, say after a Group Header segment (offset Z mapping to (chunk Y offset Z)).

I believe this might be possible if (1) the reader is stateless, or (2) I can fetch and inject state into the reader - eg., the delimiters detected from the initial block, etc. Also, I'd need to capture the offset location for the start of the next segment that I'd then continue processing the InputStream for the next group segment, etc.

Is this making any sense to you? Is this possible with staedi? Do you have any alternative suggestions or pointers for handling large edi inputs?

Thanks!

MikeEdgar · 2024-05-07T09:59:39Z

MikeEdgar
May 7, 2024
Maintainer

I would suggesting parsing your large input file directly through staedi and then chunking the output to smaller files, if that is your desired strategy. The parser is designed to minimize memory usage and have decent throughput for scenarios like this. Doing something like skipping to a particular offset wouldn't work well at the moment. Each reader instance is indeed stateful and an input block always needs to be (for X12) an ISA segment through to an IEA segment.

3 replies

cpotter1 May 7, 2024
Author

Would you be open for me to follow an enhancement-type issue? Seems like a useful thing to provide.

MikeEdgar May 8, 2024
Maintainer

Possibly, but I hesitate to make a change that could be too disruptive. Out of curiosity, is your requirement that it must be split prior to parsing? If so, why before rather than after? Also, what is the limitation that necessitates splitting the input?

cpotter1 May 10, 2024
Author

I don't think I'll be going in this direction now. Thank you for considering it though.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Handling large X12s #460

{{title}}

Replies: 1 comment 3 replies

{{title}}

{{title}}

{{title}}

{{title}}

Select a reply

Handling large X12s #460

cpotter1 May 7, 2024

Replies: 1 comment · 3 replies

MikeEdgar May 7, 2024 Maintainer

cpotter1 May 7, 2024 Author

MikeEdgar May 8, 2024 Maintainer

cpotter1 May 10, 2024 Author

cpotter1
May 7, 2024

Replies: 1 comment 3 replies

MikeEdgar
May 7, 2024
Maintainer

cpotter1 May 7, 2024
Author

MikeEdgar May 8, 2024
Maintainer

cpotter1 May 10, 2024
Author