[Feature request] Allow sending messages with (nearly) unlimited size #492

sudden6 · 2017-03-03T23:21:58Z

Currently every Tox client must split long messages into smaller parts. I think it would be beneficial to the user experience and the developer experience if toxcore supported messages with unlimited size or with sizes that are way longer than the current maximum (1372 bytes).

I think this can be implemented either by introducing a flag that indicates a message as part of a multipart message.

yurivict · 2017-03-04T03:08:22Z

I almost implemented this in qTox.
This requires persistent storage that toxcore doesn't have.

aaannndddyyy · 2017-03-04T08:27:18Z

+1

sudden6 · 2017-03-04T09:04:58Z

@yurivict I'm interested in how you implemented this in qTox, anything public yet?

Also how do you keep compatibility to other clients?

I also think it can be done without persistent storage, my simple approach would be to only mark a message as confirmed delivered when all parts have arrived, but to retransmit parts as long as both clients are online.

yurivict · 2017-03-05T00:47:02Z

@sudden6 If the messages aren't too large, it is ok to defragment without persistent storage. But imagine that somebody will send 1GB message? He may need to reboot the computer in between, and previous parts will get lost and he will need to start all over again. It is better to have persistent storage IMO in the general case.

I will release qTox with defragmenter once I will work out few remaining problems.

I also think that it is better to have a mid-level library, xtox, that will contain defragmenter itself to allow other clients to use it. This assumes that all clients use sqlite. I am not sure if this is a reasonable assumption.

This xtox library could export another version of Tox interface that will allow large messages.
Bug #499 is a pre-requisite for such library.

isotoxin · 2017-03-09T16:49:14Z

Isotoxin has long able to send messages of unlimited size.

I almost implemented this in qTox.

It will be good to make it compatible together, but I understand it is impossible.

yurivict · 2017-03-09T18:17:51Z

I am going to release the library libtox-defragmenter.so that will do message defragmentation. I am polishing the details now.

This shouldn't be done in the clients, because then defragmentation isn't going to be compatible between clients.

yurivict · 2017-03-11T18:10:01Z

Tox-defragmenter library: https://github.com/yurivict/tox-defragmenter

It needs:

Structured API in the tox library: [overridable API] Added Tox API in a structure form. #502
(for qTox) Wrapped API in the client: Wrapped Tox API in macros. qTox/qTox#4238
The fix in toxcore: Fixed the bug when receipts for messages sent from the receipt callback never arrived. #500

qTox compatible with defragmenter is in my branch: https://github.com/yurivict/qTox/tree/tox-defragmenter

I will likely update tox-defragmenter based on further testing.

TODO:
Make client aware of the persistent nature of receipts returned by tox-defragmenter. Table faux_offline_pending will need the field persistent_receipt which will make qTox show that the message isn't yet sent, but will prevent it from resending it.

sudden6 · 2017-03-12T12:37:16Z

I don't think tox-defragmenter should come with it's own db, most clients already have a db or something else to store persistent data.

In your use case it fits perfectly for qTox, but what about more lightweight clients that don't use sqlcipher?

@isotoxin can you describe the approach you used?

yurivict · 2017-03-12T16:33:15Z

I don't think tox-defragmenter should come with it's own db, most clients already have a db or something else to store persistent data.

It doesn't come with db. It uses the db supplied by the client.

In your use case it fits perfectly for qTox, but what about more lightweight clients that don't use sqlcipher?

The persistent storage is needed to handle large messages. It can be made to not explicitly depend on sqlcipher, only require the functions, so that both sqlcipher and SQLite will fit.

sudden6 · 2017-03-12T16:45:57Z

AFAICT your code directly depends on sqlcipher or SQLite, but what if a client wants to use (encrypted) text files as storage?

I didn't look to deep into your code, but what happens if a "normal" client receives a message with your splitting protocol?

yurivict · 2017-03-13T12:54:26Z

AFAICT your code directly depends on sqlcipher or SQLite, but what if a client wants to use (encrypted) text files as storage?

I added the ability to use the in-memory database: tox_defragmenter_initialize_db_inmemory().

I didn't look to deep into your code, but what happens if a "normal" client receives a message with your splitting protocol?

They will still show it as split, as it happens today.

yurivict · 2017-03-16T19:56:15Z

@sudden6

Here's what I did:

Added the ability to use internal data structures for clients that don't use sqlite or sqlcipher.
Added the test suite to https://github.com/yurivict/tox-defragmenter that currently runs stress tests. Will add more tests later.

Now any client is able to use defragmenter.

Do you think the dependency pull requests for toxcore and qTox can be merged?
Once they are merged, I can create the pull request for qTox with the option to enable long messages.

I would like to add the file attachment feature in qTox, like in e-mail. It will be able to send file attachments in MIME format. But I can't proceed with this without the ability to send large messages.

GrayHatter · 2017-03-17T01:49:36Z

@yurivict You want message splitting in toxcore so you can send files?

yurivict · 2017-03-17T02:27:06Z

I would like to send large messages to support various other features, like attachments.
IMO, it's better to keep toxcore simple. Toxcore lacks persistent storage, without which it is hard to send large messages over slow networks.

GrayHatter · 2017-03-17T05:54:27Z

you know toxcore has a file transfer API right?

As well as generic lossless packets?

yurivict · 2017-03-17T06:06:42Z

Lossless packets are limited in size by TOX_MAX_CUSTOM_PACKET_SIZE=1373.
There are benefits that defragmenter has:

It extends API to seamlessly support arbitrarily long packets.
Long packet transfers are persistent over client restarts.

I don't think file transfers can be continued after client restart.

GrayHatter · 2017-03-17T06:08:48Z

uTox has no problem restarting an in-progress file transfer. Through network failures, client restarts, etc...

https://github.com/TokTok/c-toxcore/blob/master/toxcore/tox.h#L1968-L1979

yurivict · 2017-03-17T06:11:12Z

The way I am proposing is client-neutral. It doesn't even require client to do anything special. How can uTox way be extended to all clients? Sending long message over the file transfer requires the special code that does this in each client.

Also, if lossless packets and file transfers were the same, why this issue is even here? Obviously, it is a problem.

The way I am proposing is through the API adapter. It's client-neutral and toxcore-neurtal. No need to add code in toxcore.

GrayHatter · 2017-03-17T06:17:25Z

The issue is here because qTox sucks?

No, not really. But I don't know why qTox can't resume file transfers. And while I agree that tox should do message splitting for the clients. If you're sending SO MUCH TEXT that you need an SQL database to send it. Perhaps via the Tox Message API is the wrong way?

Maybe a file/document?

Sure you COULD send files and such via tox messages, but if your solution to doing so is; interface with a SQL database, instead of the existing file transfer API. I have to wonder WHY?

yurivict · 2017-03-17T06:21:32Z

I am not sending files, I am extending messaging to send e-mails. File sending is a different feature. My approach avoids splitting messages, and avoids adding complexity to clients and toxcore.

Practically speaking, how can long messages be currently implemented? Every client needs to implement them through file transfers/lossless packets. They need to keep track of the state, to restart from the point when interrupted, etc. With my approach they do not need to do anything above of what they do now. So what is your objection?

GrayHatter · 2017-03-17T06:23:00Z

But toxcore is an instant messenger, not an email client. And IMO, shouldn't try to be both. Do one thing, and do it really well.

yurivict · 2017-03-17T06:24:33Z

Why can't it be both? It is convenient to send files through the same path as short messages are sent.

GrayHatter · 2017-03-17T06:25:35Z

Why can't cars make coffee? The use case for an instant messenger is different from email.

yurivict · 2017-03-17T06:26:28Z

Message can be short or long. Why do you want to limit what Tox can do? The more features the better. IM isn't supposed to do video too.

yurivict · 2017-03-17T06:27:46Z

The system becomes great when it has ecosystem, and is feature-rich. It's not good to limit features of Tox, IMO.

optimumtact · 2017-03-17T06:29:38Z

Tox's stated goal is to be able to replace skype in users lives, anything outside that area would probably be best suited for a fork that can better cater to a protocol like meail

yurivict · 2017-03-17T06:31:31Z

Asking for forks is a death of the project. Which fork users should use then, or devs contribute to? There should be one strong system. Why can't we innovate above of what skype can do? Is skype a golden standard? I am for innovation.

yurivict · 2017-03-17T06:32:51Z

Many people are crazy about the Slack IM. Slack supports attachments, directory and conversation sharing, etc. Why we should model skype, and not slack?

Chuongv · 2017-03-17T06:36:27Z

That analogy is poor. A more correct analogy is that you are suggesting to convert a already made product (Skype) into Slack which is entirely different.

yurivict · 2017-03-17T06:39:03Z

into Slack which is entirely different.

The features aren't mutually exclusive.

Why I can send a 3 sentence message, and can't send a 5 page article? It doesn't make sense to me. Larger size messaging is a natural expectation. Who decides what is IM and what isn't? Is there a definition anywhere?

I asked friends, they said they didn't realize the message size is limited. They asked "Why is the size limited?" The intuitive expectation is that sizes aren't limited.

optimumtact · 2017-03-17T06:44:45Z

forks are not the death of a project? they're a healthy part of the open source ecosystem.

Diadlo · 2017-03-17T07:31:19Z

Why is the size limited?

@yurivict Yes, size of the message is limited. But long message can be splitted on few parts and sended separately. AFAIK, it's how most clients do it now.
ADDED: And I don't understand, what's wrong with this solution?

yurivict · 2017-03-17T07:38:16Z

es, size of the message is limited. But long message can be splitted on few parts and sended separately. AFAIK, it's how most clients do it now

@Diadlo I know. That's the problem I am trying to solve (actually, I did solve it). But people here seem to think that limited message size is a feature, and not a problem, and shouldn't be solved.

Diadlo · 2017-03-17T07:40:52Z

@yurivict What the problem with auto-split?

yurivict · 2017-03-17T07:46:18Z

One problem is that when the larger article gets split, it can't be easily coped-pasted in the same form. Timestamps get copied with it (in qTox). Timestamps shouldn't be added in the middle of the message.

Another problem is that such split breaks paragraphs.

Anther problem is that such split limits many further features. For example, I want to have file attachments. They aren't possible with split.

Another problem is that splitting isn't a natural or intuitive behavior. It's a technical glitch. I think the limit is in part due to UDP.

yurivict · 2017-03-17T07:55:58Z

Skype has the limit of 800 characters for the first message, and 8000 characters for the subsequent messages.

In this recent thread skype users ask skype to eliminate this message limit: https://www.skypefeedback.com/forums/299913-generally-available/suggestions/13124298--message-is-to-long-to-send

Diadlo · 2017-03-17T07:58:36Z

Just small IMO: It's bad idea send message over 8k characters through the chat

sudden6 · 2017-03-17T08:03:01Z

I agree with @yurivict that the limit for one message is too small. I don't think the limit should be several GB, but ~10K would be nice to not have sudden breaks in long messages.

For the emali idea, just send it over existing filetransfer.

yurivict · 2017-03-17T08:11:10Z

I like when several files are attached to the message, not sent separately, one by one. I would gladly use e-mail, but everybody knows that e-mail is insecure. I don't understand why does the limit exist.

GrayHatter · 2017-03-17T08:56:37Z

@yurivict, so a msg can fit within a single UDP packet

…

On Fri, Mar 17, 2017 at 1:11 AM, yurivict ***@***.***> wrote: I like when several files are *attached* to the message, not just sent in an assorted way one by one. I would gladly use e-mail, but everybody knows that e-mail is insecure. I don't understand why does the limit exist. — You are receiving this because you commented. Reply to this email directly, view it on GitHub <#492 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/AAO20LYBvGc7RZn6D9xBBIlnK63f6-iCks5rmkAggaJpZM4MS3Da> .

sudden6 · 2017-03-18T16:58:16Z

IMO this problem could be better solved by extending Messenger with a new packet type, e.g. MESSAGE_MULTI.

This new packet type would then contain a header with fragment number and eventually some other metadata.

The receiver will acknowledge the arrival of every fragment of the message and if all parts arrived acknowledge the whole message. If one of the clients goes offline during the transfer, it's the responsibility of the sender to re-transmit the message when both parts are online again. Fragments are only stored in memory.

The tox_friend_send_message function would decide based on the message length if a multi part message is needed.

PROS

no api change, the current API has no size limit, clients using TOX_MAX_MESSAGE_LENGTH correctly will just start to send long messages
no persistent storage needed
no addon libs needed

CONS

Probably not good for very large (several MB) messages, because every fragment needs to get re-transmitted on connection drops
@zetok brought up the possibility of an Out of Memory DoS attack

CHALLENGES

Fallback for clients that don't yet have this feature? Implement client capability packet #513 would also help
Ensure integrity of the whole message -> probably a good idea to include a hash over the whole message

I think such a feature will be very a huge win for Tox, with not much cost to implement on the existing infrastructure.

yurivict · 2017-03-18T17:24:04Z

@sudden6

no api change, the current API has no size limit

But the limit is returned by tox_max_message_length() (same as TOX_MAX_MESSAGE_LENGTH).

CONS
Probably not good for very large (several MB) messages

^ yes. I don't understand your resistance to the persistent storage. Without it larger messages will suffer progressively. There is no way around it without the storage.

IMO this problem could be better solved by extending Messenger with a new packet type, e.g. MESSAGE_MULTI.

Why have two types of messages based on size? Doesn't this unnecessarily complicate the API?

sudden6 · 2017-03-18T17:47:30Z

But the limit is returned by tox_max_message_length() (same as TOX_MAX_MESSAGE_LENGTH).

yeah, that's why the API isn't changed, on new toxcore versions this would just return a higher value.

I don't understand your resistance to the persistent storage. Without it larger messages will suffer progressively. There is no way around it without the storage.

For very large messages there's already a better solution IMO, namely file transfers. The goal of my suggestion is to raise the limit of how big a text message can be, not to transfer files as or huge amounts of data.

Why have two types of messages based on size? Doesn't this unnecessarily complicate the API?

The packet type is only toxcore internal, clients will not know how the message is sent and only get a pointer to the message + it's size like it is now.

isotoxin · 2017-03-18T18:50:29Z

IMO this problem could be better solved by extending Messenger with a new packet type, e.g. MESSAGE_MULTI

Agree.
But my suggestion is just set limit of message length to 64kbytes.
That it. No one needs a message size greater than 64kb. Why more?
Btw, Isotoxin's message input control is limited to 8kb. Even this is enough for everyone.

sudden6 · 2017-03-18T20:57:26Z

@isotoxin yes, I agree that a limit is needed, 64KB seems reasonable. Does isotoxin also use another packet type for large messages?

yurivict · 2017-03-18T22:15:21Z

No one needs a message size greater than 64kb. Why more?
... 8kb. Even this is enough for everyone.

You are factually wrong. I need messages more than 8kB.
And in this thread hundreds of users said that they need longer messages too: https://www.skypefeedback.com/forums/299913-generally-available/suggestions/13124298--message-is-to-long-to-send

Diadlo · 2017-03-18T22:33:58Z

@yurivict Again: tox doesn't block long message. You will never have a error like: "message to long"
But really looong message open way to "Out of Memory DoS attack"

isotoxin · 2017-03-19T09:57:01Z

Does isotoxin also use another packet type for large messages?

It uses same message packet id, but it appends special marker ("\1\1") at end of message, if it's size is bigger then current limit. Even target client does not support "message chain" capability, it receive correct chain of messages.
If target client support "message chain" capability, and if it detected special marker at end of message, it just wait for 5 seconds and append next messages to marked one.

sudden6 added suggestion Suggestions messenger Messenger labels Mar 3, 2017

This was referenced Mar 11, 2017

[overridable API] Added Tox API in a structure form. #502

Closed

Wrapped Tox API in macros. qTox/qTox#4238

Closed

iphydf added this to the v0.3.0 milestone Jan 16, 2018

iphydf added the P3 Low priority label Apr 27, 2020

[Feature request] Allow sending messages with (nearly) unlimited size #492

[Feature request] Allow sending messages with (nearly) unlimited size #492

Comments

sudden6 commented Mar 3, 2017

yurivict commented Mar 4, 2017 • edited Loading

aaannndddyyy commented Mar 4, 2017

sudden6 commented Mar 4, 2017

yurivict commented Mar 5, 2017

isotoxin commented Mar 9, 2017

yurivict commented Mar 9, 2017

yurivict commented Mar 11, 2017 • edited Loading

sudden6 commented Mar 12, 2017

yurivict commented Mar 12, 2017 • edited Loading

sudden6 commented Mar 12, 2017

yurivict commented Mar 13, 2017

yurivict commented Mar 16, 2017

GrayHatter commented Mar 17, 2017

yurivict commented Mar 17, 2017 • edited Loading

GrayHatter commented Mar 17, 2017

yurivict commented Mar 17, 2017

GrayHatter commented Mar 17, 2017

yurivict commented Mar 17, 2017 • edited Loading

GrayHatter commented Mar 17, 2017

yurivict commented Mar 17, 2017 • edited Loading

GrayHatter commented Mar 17, 2017

yurivict commented Mar 17, 2017 • edited Loading

GrayHatter commented Mar 17, 2017

yurivict commented Mar 17, 2017

yurivict commented Mar 17, 2017

optimumtact commented Mar 17, 2017

yurivict commented Mar 17, 2017

yurivict commented Mar 17, 2017 • edited Loading

Chuongv commented Mar 17, 2017

yurivict commented Mar 17, 2017 • edited Loading

optimumtact commented Mar 17, 2017

Diadlo commented Mar 17, 2017 • edited Loading

yurivict commented Mar 17, 2017

Diadlo commented Mar 17, 2017

yurivict commented Mar 17, 2017 • edited Loading

yurivict commented Mar 17, 2017

Diadlo commented Mar 17, 2017

sudden6 commented Mar 17, 2017

yurivict commented Mar 17, 2017 • edited Loading

GrayHatter commented Mar 17, 2017 via email

sudden6 commented Mar 18, 2017

yurivict commented Mar 18, 2017 • edited Loading

sudden6 commented Mar 18, 2017

isotoxin commented Mar 18, 2017

sudden6 commented Mar 18, 2017

yurivict commented Mar 18, 2017

Diadlo commented Mar 18, 2017 • edited Loading

isotoxin commented Mar 19, 2017

yurivict commented Mar 4, 2017 •

edited

Loading

yurivict commented Mar 11, 2017 •

edited

Loading

yurivict commented Mar 12, 2017 •

edited

Loading

yurivict commented Mar 17, 2017 •

edited

Loading

yurivict commented Mar 17, 2017 •

edited

Loading

yurivict commented Mar 17, 2017 •

edited

Loading

yurivict commented Mar 17, 2017 •

edited

Loading

yurivict commented Mar 17, 2017 •

edited

Loading

yurivict commented Mar 17, 2017 •

edited

Loading

Diadlo commented Mar 17, 2017 •

edited

Loading

yurivict commented Mar 17, 2017 •

edited

Loading

yurivict commented Mar 17, 2017 •

edited

Loading

yurivict commented Mar 18, 2017 •

edited

Loading

Diadlo commented Mar 18, 2017 •

edited

Loading