John Snow Labs Spark-NLP 4.2.3: Improved CoNLLGenerator annotator, new rules parameter in RegexMatcher, new IAnnotation feature for LightPipeline in Scala, and bug fixes #13074
maziyarpanahi
announced in
Announcement
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
📢 Overview
Spark NLP 4.2.3 🚀 comes with new improvements to the
CoNLLGenerator
annotator, a new way to pass rules to theRegexMatcher
annotator, unifying control over a number of columns insetInputCols
between the Scala and Python, new documentation for our newIAnnotation
feature for those who are using Spark NLP in Scala, and bug fixes.Do not forget to visit Models Hub with over 11400+ free and open-source models & pipelines. As always, we would like to thank our community for their feedback, questions, and feature requests. 🎉
⭐ New Features & improvements
CoNLLGenerator
annotatorCoNLLGenerator
annotator when writing to CSV and preserve special char tokenIAnnotation
feature for Scala usersBug Fixes
NotSerializableException
when theWordEmbeddings
annotator is used over the K8s cluster whilesetEnableInMemoryStorage
is set totrue
RegexTokenizer
annotator when it outputs the wrong indexes if the pattern includes splits that are not followed by a spaceCoNLL()
,CoNLLU()
,POS()
, andPubTator()
CoNLLGenerator
annotator where the token has non-int metadataSentencePiece
model's name required forDeBertaForQuestionAnswering
andDeBertaEmbeddings
when importing modelsNaNs
result in some ViTForImageClassification models/pipelines📓 New Notebooks
📖 Documentation
Installation
Python
#PyPI pip install spark-nlp==4.2.3
Spark Packages
spark-nlp on Apache Spark 3.0.x, 3.1.x, 3.2.x, and 3.3.x (Scala 2.12):
GPU
M1
AArch64
Maven
spark-nlp on Apache Spark 3.0.x, 3.1.x, 3.2.x, and 3.3.x:
spark-nlp-gpu:
spark-nlp-m1:
spark-nlp-aarch64:
FAT JARs
CPU on Apache Spark 3.x/3.1.x/3.2.x/3.3.x: https://s3.amazonaws.com/auxdata.johnsnowlabs.com/public/jars/spark-nlp-assembly-4.2.3.jar
GPU on Apache Spark 3.0.x/3.1.x/3.2.x/3.3.x: https://s3.amazonaws.com/auxdata.johnsnowlabs.com/public/jars/spark-nlp-gpu-assembly-4.2.3.jar
M1 on Apache Spark 3.0.x/3.1.x/3.2.x/3.3.x: https://s3.amazonaws.com/auxdata.johnsnowlabs.com/public/jars/spark-nlp-m1-assembly-4.2.3.jar
AArch64 on Apache Spark 3.0.x/3.1.x/3.2.x/3.3.x: https://s3.amazonaws.com/auxdata.johnsnowlabs.com/public/jars/spark-nlp-aarch64-assembly-4.2.3.jar
What's Changed
Full Changelog: 4.2.2...4.2.3
Beta Was this translation helpful? Give feedback.
All reactions