diff --git a/docs/_posts/egenc/2022-01-04-check_spelling_dl_en.md b/docs/_posts/egenc/2022-01-04-check_spelling_dl_en.md new file mode 100644 index 00000000000000..918bdc4e054e78 --- /dev/null +++ b/docs/_posts/egenc/2022-01-04-check_spelling_dl_en.md @@ -0,0 +1,101 @@ +--- +layout: model +title: Spell Checker in English Text +author: John Snow Labs +name: check_spelling_dl +date: 2022-01-04 +tags: [open_source, en, spell, dl] +task: Spell Check +language: en +edition: Spark NLP 3.3.0 +spark_version: 2.4 +supported: true +article_header: + type: cover +use_language_switcher: "Python-Scala-Java" +--- + +## Description + +Spell Checker is a sequence-to-sequence pipeline that detects and corrects spelling errors in your input text. It's based on Levenshtein Automaton for generating candidate corrections and a Neural Language Model for ranking corrections. You can download the pretrained pipeline that comes ready to use. + +{:.btn-box} + + +[Download](https://s3.amazonaws.com/auxdata.johnsnowlabs.com/public/models/check_spelling_dl_en_3.3.0_2.4_1641301117763.zip){:.button.button-orange.button-orange-trans.arr.button-icon} + +## How to use + + + +
+{% include programmingLanguageSelectScalaPythonNLU.html %} +```python +from sparknlp.pretrained import PretrainedPipeline + +pipeline = PretrainedPipeline('check_spelling_dl', lang='en') +result = pipeline.fullAnnotate("During the summer we have the hottest ueather. I have a black ueather jacket, so nice.I intrduce you to my sister, she is called ueather.") +``` +```scala +import com.johnsnowlabs.nlp.pretrained.PretrainedPipeline + +val pipeline = new PretrainedPipeline("check_spelling_dl", lang = "en") +val result = pipeline.fullAnnotate("During the summer we have the hottest ueather. I have a black ueather jacket, so nice.I intrduce you to my sister, she is called ueather.") +``` +
+ +## Results + +```bash +[('During', 'During'), + ('the', 'the'), + ('summer', 'summer'), + ('we', 'we'), + ('have', 'have'), + ('the', 'the'), + ('hottest', 'hottest'), + ('ueather', 'weather'), + ('.', '.'), + ('I', 'I'), + ('have', 'have'), + ('a', 'a'), + ('black', 'black'), + ('ueather', 'leather'), + ('jacket', 'jacket'), + (',', ','), + ('so', 'so'), + ('nice', 'nice'), + ('.', '.'), + ('I', 'I'), + ('intrduce', 'introduce'), + ('you', 'you'), + ('to', 'to'), + ('my', 'my'), + ('sister', 'sister'), + (',', ','), + ('she', 'she'), + ('is', 'is'), + ('called', 'called'), + ('ueather', 'Heather'), + ('.', '.')] +``` + +{:.model-param} +## Model Information + +{:.table-model} +|---|---| +|Model Name:|check_spelling_dl| +|Type:|pipeline| +|Compatibility:|Spark NLP 3.3.0+| +|License:|Open Source| +|Edition:|Official| +|Language:|en| +|Size:|118.1 MB| + +## Included Models + +- DocumentAssembler +- SentenceDetectorDLModel +- TokenizerModel +- ContextSpellCheckerModel \ No newline at end of file