Sentence Transformer encodings #1168

Gchand0249 · 2021-09-14T06:33:21Z

Hi ,

We finetuned Sentence transformers on our domain specific data (similar to NLI data). It is giving high cosine score for irrelevant suggestions . We used good , bad , ok while labeling the data.

nreimers · 2021-09-14T06:34:38Z

Adding a question to your issue would be quite helpful.

Gchand0249 · 2021-09-14T06:43:59Z

yes , After extracting embeddings from sbert we using cosine score for sorting results. Issue here is results with high cosine score are irrelevant . And similar results are getting less score. We are unable to figure it out why it is happening

nreimers · 2021-09-14T06:48:24Z

Likely due to wrongly training the model.

Gchand0249 · 2021-09-15T04:25:49Z

Thank you , Does performance depends on batch size ?

Could you please elaborate what does it mean by wrongly training ? Epoch, batch size or Data perspective
or Loss

We trained for 4 epochs with batch size 16 and used SoftmaxLoss .

nreimers · 2021-09-15T05:02:31Z

SoftmaxLoss is the wrong loss. Have a look at the other losses functions

Gchand0249 · 2021-09-15T06:33:56Z

Thanks for your replay , Could you please suggest me preferable loss to train sbert ?

nreimers · 2021-09-15T07:16:37Z

MultipleNegativesRankingLoss or one of the triplet losses

Gchand0249 · 2021-09-15T10:37:33Z

Thank you @nreimers ,

Could you please explain why SoftmaxLoss is the wrong loss ? In the sbert website you mentioned that you used softmax loss for training sbert on NLI data and our data labels are similar to NLI data.

Gchand0249 · 2021-09-16T07:25:31Z

Thank you @nreimers ,

Could you please explain why SoftmaxLoss is the wrong loss ? In the sbert website you mentioned that you used softmax loss for training sbert on NLI data and our data labels are similar to NLI data.

nreimers · 2021-09-16T07:30:27Z

That it works on NLI is rather a coincidence, but there is not good logic behind it:
https://www.sbert.net/examples/training/nli/README.html#multiplenegativesrankingloss

Gchand0249 · 2021-09-17T05:48:00Z

Thank you for you suggestion , In hard multiplenegativesrankingloss , team stated that "" You can also provide one or multiple hard negatives per anchor-positive pair by structering the data like this: (a_1, p_1, n_1), (a_2, p_2, n_2)
Here, n_1 is a hard negative for (a_1, p_1). The loss will use for the pair (a_i, p_i) all p_j (j!=i) and all n_j as negatives. ""

Could you please eloberate this statement . Does it mean loss use P_j and n_j as negatives for a_i ?

nreimers · 2021-09-17T07:05:42Z

Yes

Gchand0249 · 2021-09-18T01:47:21Z

We did synonym expansion for data and in our case most of a_i and p_j are positive. How does it works in this case. Won't it effect the embeddings?

Gchand0249 · 2021-09-21T00:29:06Z

We did synonym expansion for data and in our case most of a_i and p_j are positive. How does it works in this case. Won't it effect the embeddings?

nreimers · 2021-09-21T03:45:35Z

Then you have to create a custom DataLoader that ensures that a batch does not contain two entries of the same type

Gchand0249 · 2021-09-21T11:21:51Z

It is very difficult for us to extract two entries of the same type. Is it okay to go with triplet loss ?

nreimers · 2021-09-21T11:24:46Z

Sure

Gchand0249 · 2021-09-21T12:16:59Z

Thank you .... Does distance_metric in triplet loss has any impact on performance ? We tried with default Euclidean preformance was not good so we are trying now with COsine.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Sentence Transformer encodings #1168

Sentence Transformer encodings #1168

Gchand0249 commented Sep 14, 2021 •

edited

Loading

nreimers commented Sep 14, 2021

Gchand0249 commented Sep 14, 2021

nreimers commented Sep 14, 2021

Gchand0249 commented Sep 15, 2021 •

edited

Loading

nreimers commented Sep 15, 2021

Gchand0249 commented Sep 15, 2021 •

edited

Loading

nreimers commented Sep 15, 2021

Gchand0249 commented Sep 15, 2021 •

edited

Loading

Gchand0249 commented Sep 16, 2021

nreimers commented Sep 16, 2021

Gchand0249 commented Sep 17, 2021

nreimers commented Sep 17, 2021

Gchand0249 commented Sep 18, 2021 •

edited

Loading

Gchand0249 commented Sep 21, 2021

nreimers commented Sep 21, 2021

Gchand0249 commented Sep 21, 2021

nreimers commented Sep 21, 2021

Gchand0249 commented Sep 21, 2021

Sentence Transformer encodings #1168

Sentence Transformer encodings #1168

Comments

Gchand0249 commented Sep 14, 2021 • edited Loading

nreimers commented Sep 14, 2021

Gchand0249 commented Sep 14, 2021

nreimers commented Sep 14, 2021

Gchand0249 commented Sep 15, 2021 • edited Loading

nreimers commented Sep 15, 2021

Gchand0249 commented Sep 15, 2021 • edited Loading

nreimers commented Sep 15, 2021

Gchand0249 commented Sep 15, 2021 • edited Loading

Gchand0249 commented Sep 16, 2021

nreimers commented Sep 16, 2021

Gchand0249 commented Sep 17, 2021

nreimers commented Sep 17, 2021

Gchand0249 commented Sep 18, 2021 • edited Loading

Gchand0249 commented Sep 21, 2021

nreimers commented Sep 21, 2021

Gchand0249 commented Sep 21, 2021

nreimers commented Sep 21, 2021

Gchand0249 commented Sep 21, 2021

Gchand0249 commented Sep 14, 2021 •

edited

Loading

Gchand0249 commented Sep 15, 2021 •

edited

Loading

Gchand0249 commented Sep 15, 2021 •

edited

Loading

Gchand0249 commented Sep 15, 2021 •

edited

Loading

Gchand0249 commented Sep 18, 2021 •

edited

Loading