John Snow Labs Spark-NLP 4.4.0: New BART for Text Translation & Summarization, new ConvNeXT Transformer for Image Classification, new Zero-Shot Text Classification by BERT, more than 4000+ state-of-the-art models, and many more! #13754
maziyarpanahi
announced in
Announcement
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
📢 Overview
We are thrilled to announce the release of Spark NLP 🚀 4.4.0! This release includes new features such as a New BART for NLG, translation, and comprehension; a new ConvNeXT Transformer for Image Classification, a new Zero-Shot Text Classification by BERT, 4000+ new state-of-the-art models, and more enhancements and bug fixes.
We want to thank our community for their valuable feedback, feature requests, and contributions. Our Models Hub now contains over 17,000+ free and truly open-source models & pipelines. 🎉
Spark NLP has a new home! https://sparknlp.org is where you can find all the documentation, models, and demos for Spark NLP. It aims to provide valuable resources to anyone interested in 100% open-source NLP solutions by using Spark NLP 🚀.
🔥 New Features
ConvNeXT Image Classification (By Facebook)
NEW: Introducing ConvNextForImageClassification annotator in Spark NLP 🚀.
ConvNextForImageClassification
can loadConvNeXT
models that compete favorably with Transformers in terms of accuracy and scalability, achieving 87.8% ImageNet top-1 accuracy and outperforming Swin Transformers on COCO detection and ADE20K segmentation, while maintaining the simplicity and efficiency of standard ConvNets.This annotator is compatible with all the models trained/fine-tuned by using
ConvNextForImageClassification
for PyTorch orTFConvNextForImageClassification
for TensorFlow models in HuggingFace 🤗BART for NLG, Translation, and Comprehension (By Facebook)
NEW: Introducing BartTransformer annotator in Spark NLP 🚀.
BartTransformer
can load BART models fine-tuned for tasks like summarizations.This annotator is compatible with all the models trained/fine-tuned by using
BartForConditionalGeneration
for PyTorch orTFBartForConditionalGeneration
for TensorFlow models in HuggingFace 🤗The abstract explains that Bart uses a standard seq2seq/machine translation architecture, similar to BERT's bidirectional encoder and GPT's left-to-right decoder. The pretraining task involves randomly shuffling the original sentences and replacing text spans with a single mask token. BART is effective for text generation and comprehension tasks, matching RoBERTa's performance with similar training resources on GLUE and SQuAD. It also achieves new state-of-the-art results on various summarization, dialogue, and question-answering tasks with gains of up to 6 ROUGE.
Zero-Shot for Text Classification by BERT
NEW: Introducing BertForZeroShotClassification annotator for Zero-Shot Text Classification in Spark NLP 🚀. You can use the
BertForZeroShotClassification
annotator for text classification with your labels! 💯Let's see how easy it is to just use any set of labels our trained model has never seen via the
setCandidateLabels()
param:For Zero-Short Multi-class Text Classification:
For Zero-Short Multi-class Text Classification:
⭐🐛 Improvements & Bug Fixes
nerHasNoSchema
param for NerConverter when labels coming from NerDLMOdel and NerCrfModel don't have any schemasetEntityName
paramWordEmbeddingsModel
bug when loading a model from S3 via thecache_folder
configWordEmbeddingsModel
bug failing when it's used withsetEnableInMemoryStorage
set toTrue
and LightPipelineenablePatternRegex
fromEntityRulerApproach
&EntityRulerModel
💾 Models
Spark NLP 4.4.0 comes with more than 4300+ new state-of-the-art pre-trained transformer models in multi-languages.
Featured Models
en
en
en
en
en
nl
zh
The complete list of all 17000+ models & pipelines in 230+ languages is available on Models Hub
📓 New Notebooks
📖 Documentation
Community support
and show off how you use Spark NLP!
Installation
Python
#PyPI pip install spark-nlp==4.4.0
Spark Packages
spark-nlp on Apache Spark 3.0.x, 3.1.x, 3.2.x, and 3.3.x (Scala 2.12):
GPU
Apple Silicon (M1 & M2)
AArch64
Maven
spark-nlp on Apache Spark 3.0.x, 3.1.x, 3.2.x, and 3.3.x:
spark-nlp-gpu:
spark-nlp-silicon:
spark-nlp-aarch64:
FAT JARs
CPU on Apache Spark 3.x/3.1.x/3.2.x/3.3.x: https://s3.amazonaws.com/auxdata.johnsnowlabs.com/public/jars/spark-nlp-assembly-4.4.0.jar
GPU on Apache Spark 3.0.x/3.1.x/3.2.x/3.3.x: https://s3.amazonaws.com/auxdata.johnsnowlabs.com/public/jars/spark-nlp-gpu-assembly-4.4.0.jar
M1 on Apache Spark 3.0.x/3.1.x/3.2.x/3.3.x: https://s3.amazonaws.com/auxdata.johnsnowlabs.com/public/jars/spark-nlp-silicon-assembly-4.4.0.jar
AArch64 on Apache Spark 3.0.x/3.1.x/3.2.x/3.3.x: https://s3.amazonaws.com/auxdata.johnsnowlabs.com/public/jars/spark-nlp-aarch64-assembly-4.4.0.jar
What's Changed
Contributors
@Cabir40 @bunyamin-polat @danilojsl @dcecchini @Meryem1425 @C-K-Loan @agsfer @maziyarpanahi @jfernandrezj @jsl-builder @DevinTDHa @josejuanmartinez @aymanechilah
Full Changelog: 4.3.2...4.4.0
Beta Was this translation helpful? Give feedback.
All reactions