Classification with x-transformers #264

RyanKim17920 · 2024-08-05T04:08:17Z

Added cls token/pooling option for NLP based full text classification

tests/classification_test.py

x_transformers/x_transformers.py

lucidrains · 2024-08-07T14:14:31Z

x_transformers/x_transformers.py

+            x = x[:, 0]
+
+        if self.use_pooling:
+            x = self.pooling(x).squeeze()


for the pooling, we need to account for masking (masked averaging)

i can take care of this if you'd like, it is all around a bit tricky

Yes, please do so. Thank you 👍

lucidrains · 2024-08-07T19:48:06Z

@RyanKim17920 do you want to try the latest changes and see if that's enough?

lucidrains · 2024-08-08T14:28:45Z

@RyanKim17920 hey Ryan, sorry for hijacking your efforts, just that the project is at a size where things need to be a bit more particular

your example should run now as

import torch
from torch import nn

from x_transformers import (
    TransformerWrapper,
    Encoder
)

# CLS token test
transformer = TransformerWrapper(
    num_tokens=6,
    max_seq_len=10,
    logits_dim=2, # num_classes 
    use_cls_token=True,
    attn_layers = Encoder(
        dim = 6,
        depth = 1,
        heads = 2,
    )
)

x = torch.randint(0, 5, (2, 10))
y = torch.tensor([0, 1])

print(x.shape)
logits = transformer(x)
print(logits.shape)
loss = nn.CrossEntropyLoss()(logits, y)

print(loss)

# BCE cls token

transformer = TransformerWrapper(
    num_tokens=6,
    max_seq_len=10,
    logits_dim=1, # num_classes 
    use_cls_token=True,
    squeeze_out_last_dim = True,
    attn_layers = Encoder(
        dim = 6,
        depth = 1,
        heads = 2,
    )
)

x = torch.randint(0, 5, (2, 10)).float()
y = torch.tensor([0, 1]).float()

print(x.shape)
logits = transformer(x).squeeze()
loss = nn.BCEWithLogitsLoss()(logits, y)

print(loss)

# pooling test
transformer = TransformerWrapper(
    num_tokens=6,
    max_seq_len=10,
    logits_dim=2, # num_classes 
    average_pool_embed = True,
    attn_layers = Encoder(
        dim = 6,
        depth = 1,
        heads = 2,
    )
)

x = torch.randint(0, 5, (2, 10))
y = torch.tensor([0, 1])

print(x.shape)
logits = transformer(x)
print(logits.shape)
loss = nn.CrossEntropyLoss()(logits, y)

print(loss)

# pooling BCE test

# pooling test
transformer = TransformerWrapper(
    num_tokens=6,
    max_seq_len=10,
    logits_dim=1, # num_classes 
    average_pool_embed = True,
    squeeze_out_last_dim = True,
    attn_layers = Encoder(
        dim = 6,
        depth = 1,
        heads = 2,
    )
)

x = torch.randint(0, 5, (2, 10)).float()
y = torch.tensor([0, 1]).float()

print(x.shape)
logits = transformer(x).squeeze()
print(logits.shape)
loss = nn.BCEWithLogitsLoss()(logits, y)

print(loss)

# normal test 

transformer = TransformerWrapper(
    num_tokens=6,
    max_seq_len=10,
    logits_dim=2, # num_classes 
    average_pool_embed = True,
    attn_layers = Encoder(
        dim = 6,
        depth = 1,
        heads = 2,
    )
)

x = torch.randint(0, 5, (1, 10))
y = torch.tensor([0])

print(x.shape)
logits = transformer(x)
print(logits.shape)

RyanKim17920 · 2024-08-20T02:18:36Z

Thank you for the improvements you've already made to my original additions. I noticed that the test/x_transformers are outdated, so those changes aren't needed anymore. However, I believe the example I provided could still be valuable. It demonstrates the usage of the NLP classification with a well-known dataset, which might be useful for users to understand how to implement it while getting a high 90% validation accuracy.

Would it be possible to add the example to the repository?

cls + pooling for classification, tested

8a3ed48

lucidrains force-pushed the main branch from e70f9f4 to cc436b3 Compare August 6, 2024 14:43

lucidrains reviewed Aug 7, 2024

View reviewed changes

tests/classification_test.py Outdated Show resolved Hide resolved

lucidrains reviewed Aug 7, 2024

View reviewed changes

changes based on comments

59c6478

RyanKim17920 added 3 commits August 8, 2024 10:03

updated code for new testing

3588919

classification example

c1be59c

allows for conflictless merging

1e02065

lucidrains force-pushed the main branch from f685c9f to 426786d Compare September 25, 2024 15:49

lucidrains force-pushed the main branch from 294284d to 7c56d23 Compare October 18, 2024 15:34

lucidrains force-pushed the main branch 3 times, most recently from c129bdc to 1ccccaa Compare October 30, 2024 16:05

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Classification with x-transformers #264

Classification with x-transformers #264

RyanKim17920 commented Aug 5, 2024

lucidrains Aug 7, 2024

lucidrains Aug 7, 2024

RyanKim17920 Aug 7, 2024

lucidrains commented Aug 7, 2024

lucidrains commented Aug 8, 2024

RyanKim17920 commented Aug 20, 2024 •

edited

Loading

Classification with x-transformers #264

Are you sure you want to change the base?

Classification with x-transformers #264

Conversation

RyanKim17920 commented Aug 5, 2024

lucidrains Aug 7, 2024

Choose a reason for hiding this comment

lucidrains Aug 7, 2024

Choose a reason for hiding this comment

RyanKim17920 Aug 7, 2024

Choose a reason for hiding this comment

lucidrains commented Aug 7, 2024

lucidrains commented Aug 8, 2024

RyanKim17920 commented Aug 20, 2024 • edited Loading

RyanKim17920 commented Aug 20, 2024 •

edited

Loading