-
PreambleThere was quite a large load for the OM Leader through writing keys by S3G MultipartUpload operations: 14 100Gb files were written to the cluster from 14 clients/hosts ( Performance problemsDuring the writing operations, OM Leader's performance was awkward: Does anybody know how to try to work the issue around?
|
Beta Was this translation helpful? Give feedback.
Replies: 2 comments 2 replies
-
There are some existing tickets to optimize the space for Multipart Upload including things like reducing the message size: As I understand it, the main issue is that currently As you suggested, removing unnecessary fields might also help to reduce the serde cost. As seen from the profile As for the multi-raft solution, this probably requires more engineering effort since this means sharding the OMs which requires more deep design decisions. Even if for example we shard by bucket, if there are many concurrent multipart uploads in the bucket, the issue would happen again. @vtutrinov As for now, do we know how large is the multipart uploads part size? Smaller part size might cause increase in the number of MPU part generated which also adds the serde cost, so maybe you can increase the size of the parts for large MPU key. For the long AllocateBlock calls, I'm not really sure since part upload will specify the size and will use CreateKey instead. Unless your Feel free to pick the tickets or file your own if needed. |
Beta Was this translation helpful? Give feedback.
-
@ivandika3, thanks for the quick response! A workaround involving increasing the part size during MPU works well. It seems the fix for the HDDS-8238 jira ticket would be a great way to resolve the problem |
Beta Was this translation helpful? Give feedback.
There are some existing tickets to optimize the space for Multipart Upload including things like reducing the message size:
As I understand it, the main issue is that currently
multipartInfoTable
storesOmMultipartKeyInfo
which contains all the previously committed parts for a particular upload ID. This means that as more parts are committed, the serde cost would increase considerably as we need to load and write the existing parts. As suggested by HDDS-8238, we can change the structure of the multipart info table (we probably need to introduce a new table) to "flatten" the structure such th…