Skip to content

Commit

Permalink
Models hub (#14335)
Browse files Browse the repository at this point in the history
Co-authored-by: ahmedlone127 <[email protected]>

* Add model 2024-02-01-bert_zero_shot_classifier_mnli_xx (#14157)

Co-authored-by: ahmedlone127 <[email protected]>

* Add model 2024-01-20-mpnet_base_question_answering_squad2_en (#14146)

Co-authored-by: DevinTDHa <[email protected]>

* 2024-02-11-bge_m3_xx (#14170)

* Add model 2024-02-11-bge_m3_xx

* Update 2024-02-11-bge_m3_xx.md

---------

Co-authored-by: ahmedlone127 <[email protected]>
Co-authored-by: Maziyar Panahi <[email protected]>

* 2024-02-16-distil_asr_whisper_small_en (#14176)

* Add model 2024-02-16-distil_asr_whisper_small_en

* Add model 2024-02-25-distil_asr_whisper_medium_en

* Add model 2024-02-26-distil_asr_whisper_large_v2_en

---------

Co-authored-by: ahmedlone127 <[email protected]>

* 2024-04-04-mpnet_embeddings_biolord_2023_c_en (#14226)

* Add model 2024-04-04-mpnet_embeddings_biolord_2023_c_en

* Update 2024-04-04-mpnet_embeddings_biolord_2023_c_en.md

---------

Co-authored-by: ahmet-mesut <[email protected]>
Co-authored-by: Ahmet Mesut BİROL <[email protected]>

* Add model 2024-04-05-uae_large_v1_en (#14229)

Co-authored-by: DevinTDHa <[email protected]>

* Add model 2024-04-22-mpnet_embeddings_biolord_2023_en (#14240)

Co-authored-by: akrztrk <[email protected]>

* Add model 2024-04-22-mpnet_embeddings_biolord_2023_en (#14241)

Co-authored-by: akrztrk <[email protected]>

* 2024-05-06-deepa_xlmroberta_ner_large_en_panx_en (#14246)

* Add model 2024-05-06-deepa_xlmroberta_ner_large_en_panx_en

* Add model 2024-05-06-deepa_xlmroberta_ner_large_panx_dataset_en

---------

Co-authored-by: SaiDeepaPeri <[email protected]>

* 2024-06-10-test25_en (#14326)

* Add model 2024-06-11-xlm_roberta_base_finetuned_panx_german_french_cyycyy_en

* Add model 2024-06-11-clinico_xlm_roberta_large_finetuned_augmented1_pipeline_en

* Add model 2024-06-11-xlm_roberta_base_finetuned_panx_german_french_cyycyy_pipeline_en

* Add model 2024-06-11-tner_xlm_roberta_base_ontonotes5_switchboard_earnings21_normalized_en

* Add model 2024-06-11-tner_xlm_roberta_base_ontonotes5_switchboard_earnings21_normalized_pipeline_en

* Add model 2024-06-11-xlm_roberta_base_finetuned_panx_german_abdus_pipeline_en

* Add model 2024-06-11-xlm_roberta_base_finetuned_panx_german_abdus_en

* Add model 2024-06-11-xlm_roberta_base_finetuned_panx_arabic_yazannasser_en

* Add model 2024-06-11-xlm_roberta_base_finetuned_panx_french_jamie613_en

* Add model 2024-06-11-xlm_roberta_base_finetuned_panx_arabic_yazannasser_pipeline_en

* Add model 2024-06-11-xlm_roberta_base_finetuned_panx_french_jamie613_pipeline_en

* Add model 2024-06-11-xlm_roberta_base_finetuned_panx_german_french_udon3_en

* Add model 2024-06-11-mongolian_davlan_xlm_roberta_base_ner_hrl_mn

* Add model 2024-06-11-xlm_roberta_base_finetuned_panx_italian_obong_pipeline_en

* Add model 2024-06-11-xlm_roberta_base_finetuned_panx_german_french_udon3_pipeline_en

* Add model 2024-06-11-xlm_roberta_base_finetuned_panx_italian_obong_en

* Add model 2024-06-11-xlm_roberta_base_finetuned_panx_italian_cataluna84_pipeline_en

* Add model 2024-06-11-mongolian_davlan_xlm_roberta_base_ner_hrl_pipeline_mn

* Add model 2024-06-11-xlm_roberta_base_finetuned_panx_german_hitakura_pipeline_en

* Add model 2024-06-11-xlm_roberta_base_finetuned_panx_german_hitakura_en

* Add model 2024-06-11-xlm_roberta_base_finetuned_panx_italian_cataluna84_en

* Add model 2024-06-11-xlm_roberta_base_finetuned_panx_german_myasa_en

* Add model 2024-06-11-xlm_roberta_base_finetuned_panx_all_chris_choi_en

* Add model 2024-06-11-xlm_roberta_base_finetuned_panx_french_gogd_en

* Add model 2024-06-11-xlm_roberta_base_finetuned_panx_all_chris_choi_pipeline_en

* Add model 2024-06-11-xlm_roberta_base_finetuned_panx_italian_yezune_pipeline_en

* Add model 2024-06-11-xlm_roberta_base_finetuned_panx_german_myasa_pipeline_en

* Add model 2024-06-11-xlm_roberta_base_finetuned_panx_italian_yezune_en

* Add model 2024-06-11-xlm_roberta_base_finetuned_panx_english_gogd_en

* Add model 2024-06-11-xlm_roberta_base_finetuned_panx_all_handun_en

* Add model 2024-06-11-xlm_roberta_base_finetuned_panx_all_handun_pipeline_en

* Add model 2024-06-11-xlm_roberta_base_finetuned_panx_english_gogd_pipeline_en

* Add model 2024-06-11-xlm_roberta_base_finetuned_panx_french_gogd_pipeline_en

* Add model 2024-06-11-xlm_roberta_base_finetuned_panx_german_dochee_en

* Add model 2024-06-11-xlm_roberta_base_finetuned_panx_german_yong_sik_en

* Add model 2024-06-11-xlm_roberta_base_finetuned_panx_german_likejazz_en

* Add model 2024-06-11-xlm_roberta_base_finetuned_panx_german_dochee_pipeline_en

* Add model 2024-06-11-xlm_roberta_base_finetuned_panx_german_yong_sik_pipeline_en

* Add model 2024-06-11-xlm_roberta_base_finetuned_panx_german_arthur_75_pipeline_en

* Add model 2024-06-11-xlm_roberta_base_finetuned_panx_german_likejazz_pipeline_en

* Add model 2024-06-11-xlm_roberta_base_finetuned_panx_all_songys_pipeline_en

* Add model 2024-06-11-xlm_roberta_base_finetuned_panx_german_arthur_75_en

* Add model 2024-06-11-xlm_roberta_base_finetuned_panx_german_mooface_en

* Add model 2024-06-11-xlm_roberta_base_finetuned_panx_german_mooface_pipeline_en

* Add model 2024-06-11-xlm_roberta_base_finetuned_panx_all_team_nave_pipeline_en

* Add model 2024-06-11-xlm_roberta_base_finetuned_panx_all_team_nave_en

* Add model 2024-06-11-xlm_roberta_base_finetuned_panx_german_jjglilleberg_en

* Add model 2024-06-11-xlm_roberta_base_finetuned_panx_german_jjglilleberg_pipeline_en

* Add model 2024-06-11-xlm_roberta_base_finetuned_panx_all_songys_en

* Add model 2024-06-11-xlm_roberta_base_finetuned_panx_french_chris_choi_en

* Add model 2024-06-11-xlm_roberta_base_finetuned_panx_english_sungkwangjoong_en

* Add model 2024-06-11-xlm_roberta_base_finetuned_panx_german_french_andrew45_en

* Add model 2024-06-11-xlm_roberta_base_finetuned_panx_french_chris_choi_pipeline_en

* Add model 2024-06-11-xlm_roberta_base_finetuned_panx_italian_gogd_en

* Add model 2024-06-11-xlm_roberta_base_finetuned_panx_english_sungkwangjoong_pipeline_en

* Add model 2024-06-11-xlm_roberta_base_finetuned_panx_german_french_andrew45_pipeline_en

* Add model 2024-06-11-xlm_roberta_base_finetuned_panx_italian_gogd_pipeline_en

* Add model 2024-06-11-xlm_roberta_base_finetuned_panx_german_wilcomply_en

* Add model 2024-06-11-xlm_roberta_base_finetuned_panx_german_wilcomply_pipeline_en

* Add model 2024-06-11-xlm_roberta_base_finetuned_panx_german_french_k3lana_en

* Add model 2024-06-11-xlm_roberta_base_finetuned_panx_german_yyabuki_en

* Add model 2024-06-11-xlm_roberta_base_finetuned_panx_german_french_k3lana_pipeline_en

* Add model 2024-06-11-xlm_roberta_base_finetuned_pnax_german_pipeline_en

* Add model 2024-06-11-xlm_roberta_base_finetuned_pnax_german_en

* Add model 2024-06-11-xlm_roberta_base_finetuned_panx_german_robinschaefer_pipeline_en

* Add model 2024-06-11-xlm_roberta_base_finetuned_panx_german_robinschaefer_en

* Add model 2024-06-11-xlm_roberta_base_finetuned_panx_german_french_likejazz_en

* Add model 2024-06-11-xlm_roberta_base_finetuned_panx_german_yyabuki_pipeline_en

* Add model 2024-06-11-xlm_roberta_base_finetuned_panx_german_amitjain171980_en

* Add model 2024-06-11-xlm_roberta_base_finetuned_panx_german_french_likejazz_pipeline_en

* Add model 2024-06-11-xlm_roberta_base_finetuned_panx_german_leotunganh_en

* Add model 2024-06-11-xlm_roberta_base_finetuned_panx_german_leotunganh_pipeline_en

* Add model 2024-06-11-xlm_roberta_base_finetuned_panx_german_amitjain171980_pipeline_en

* Add model 2024-06-11-xlm_roberta_base_finetuned_panx_arabic_zaid_33_en

* Add model 2024-06-11-xlm_roberta_base_finetuned_panx_german_french_sunwooooong_en

* Add model 2024-06-11-xlm_roberta_base_finetuned_panx_arabic_zaid_33_pipeline_en

* Add model 2024-06-11-xlm_roberta_base_finetuned_panx_german_french_sunwooooong_pipeline_en

* Add model 2024-06-11-xlm_roberta_base_finetuned_panx_english_chris_choi_pipeline_en

* Add model 2024-06-11-xlm_roberta_base_finetuned_panx_german_qilin1_pipeline_en

* Add model 2024-06-11-xlm_roberta_base_finetuned_panx_german_qilin1_en

* Add model 2024-06-11-xlm_roberta_base_finetuned_panx_english_chris_choi_en

* Add model 2024-06-11-xlm_roberta_base_finetuned_panx_german_karolk_en

* Add model 2024-06-11-xlm_roberta_base_finetuned_panx_german_karolk_pipeline_en

* Add model 2024-06-11-xlm_r_galen_distemist_es

* Add model 2024-06-11-xlm_roberta_base_finetuned_panx_italian_maxnet_en

* Add model 2024-06-11-xlm_r_galen_distemist_pipeline_es

* Add model 2024-06-11-xlm_roberta_base_finetuned_panx_italian_maxnet_pipeline_en

* Add model 2024-06-11-xlm_roberta_base_finetuned_panx_german_sungkwangjoong_en

* Add model 2024-06-11-xlm_roberta_base_finetuned_panx_english_ysige_pipeline_en

* Add model 2024-06-11-xlm_roberta_base_finetuned_panx_german_sungkwangjoong_pipeline_en

* Add model 2024-06-11-xlm_roberta_base_finetuned_panx_german_anniepyim_en

* Add model 2024-06-11-xlm_roberta_base_finetuned_panx_german_anniepyim_pipeline_en

* Add model 2024-06-11-xlm_roberta_base_finetuned_panx_italian_amartyobanerjee_en

* Add model 2024-06-11-xlm_roberta_base_finetuned_panx_german_french_heerak_en

* Add model 2024-06-11-xlm_roberta_base_finetuned_panx_italian_amartyobanerjee_pipeline_en

* Add model 2024-06-11-xlm_roberta_base_finetuned_panx_italian_leotunganh_en

* Add model 2024-06-11-xlm_roberta_base_finetuned_panx_english_ysige_en

* Add model 2024-06-11-xlm_roberta_base_finetuned_panx_italian_leotunganh_pipeline_en

* Add model 2024-06-11-xlm_roberta_base_finetuned_panx_german_french_heerak_pipeline_en

* Add model 2024-06-11-xlm_roberta_base_finetuned_panx_all_aiventurer_en

* Add model 2024-06-11-xlm_roberta_base_finetuned_panx_italian_malduwais_en

* Add model 2024-06-11-xlm_roberta_base_finetuned_panx_german_chris_choi_en

* Add model 2024-06-11-xlm_roberta_base_finetuned_panx_italian_malduwais_pipeline_en

* Add model 2024-06-11-xlm_roberta_base_pie_en

* Add model 2024-06-11-xlm_roberta_base_pie_pipeline_en

* Add model 2024-06-11-xlm_roberta_base_finetuned_panx_all_aiventurer_pipeline_en

* Add model 2024-06-11-xlm_roberta_base_finetuned_panx_german_ducdh1210_en

* Add model 2024-06-11-xlm_norwegian_i_en

* Add model 2024-06-11-xlm_roberta_base_finetuned_panx_french_aiventurer_en

* Add model 2024-06-11-xlm_roberta_base_finetuned_panx_german_chris_choi_pipeline_en

* Add model 2024-06-11-xlm_roberta_base_finetuned_panx_french_aiventurer_pipeline_en

* Add model 2024-06-11-xlm_norwegian_i_pipeline_en

* Add model 2024-06-11-xlm_roberta_base_finetuned_panx_german_ducdh1210_pipeline_en

* Add model 2024-06-11-xlm_roberta_base_finetuned_panx_italian_philosucker_en

* Add model 2024-06-11-xlm_roberta_base_finetuned_panx_german_mj03_en

* Add model 2024-06-11-xlm_roberta_base_finetuned_panx_german_maxnet_en

* Add model 2024-06-11-xlm_roberta_base_finetuned_panx_french_team_nave_en

* Add model 2024-06-11-xlm_roberta_base_finetuned_panx_german_mj03_pipeline_en

* Add model 2024-06-11-xlm_roberta_base_finetuned_panx_french_koroku_pipeline_en

* Add model 2024-06-11-xlm_roberta_base_finetuned_panx_german_maxnet_pipeline_en

* Add model 2024-06-11-xlm_roberta_base_finetuned_panx_italian_philosucker_pipeline_en

* Add model 2024-06-11-xlm_roberta_base_finetuned_panx_french_team_nave_pipeline_en

* Add model 2024-06-11-xlm_roberta_base_finetuned_panx_french_koroku_en

* Add model 2024-06-11-xlm_roberta_base_finetuned_panx_german_backdrive_pipeline_en

* Add model 2024-06-11-xlm_roberta_base_finetuned_panx_english_inniok_en

* Add model 2024-06-11-xlm_roberta_base_finetuned_panx_german_french_ahmad_alismail_pipeline_en

* Add model 2024-06-11-xlm_roberta_base_finetuned_panx_english_inniok_pipeline_en

* Add model 2024-06-11-xlm_roberta_base_finetuned_panx_english_kiechu_en

* Add model 2024-06-11-xlm_roberta_base_finetuned_panx_german_backdrive_en

* Add model 2024-06-11-xlm_roberta_base_finetuned_panx_english_kiechu_pipeline_en

* Add model 2024-06-11-xlm_roberta_base_finetuned_panx_german_french_ahmad_alismail_en

* Add model 2024-06-11-xlm_roberta_base_finetuned_panx_german_hitoshinagaoka_en

* Add model 2024-06-11-xlm_roberta_base_finetuned_panx_english_shinta0615_en

* Add model 2024-06-11-xlm_roberta_base_finetuned_panx_all_kenhoffman_en

* Add model 2024-06-11-xlm_roberta_base_finetuned_panx_german_hitoshinagaoka_pipeline_en

* Add model 2024-06-11-xlm_roberta_base_finetuned_panx_all_kenhoffman_pipeline_en

* Add model 2024-06-11-xlm_roberta_base_finetuned_panx_english_shinta0615_pipeline_en

* Add model 2024-06-11-xlm_roberta_base_finetuned_panx_italian_h_radiolo_tech_en

* Add model 2024-06-11-xlm_roberta_base_finetuned_panx_italian_h_radiolo_tech_pipeline_en

* Add model 2024-06-11-xlm_roberta_base_finetuned_panx_german_udon3_en

* Add model 2024-06-11-clinico_xlm_roberta_en

* Add model 2024-06-11-xlm_roberta_base_finetuned_panx_german_udon3_pipeline_en

* Add model 2024-06-11-xlm_roberta_base_finetuned_panx_french_kenhoffman_en

* Add model 2024-06-11-clinico_xlm_roberta_pipeline_en

* Add model 2024-06-11-xlm_roberta_base_finetuned_panx_french_kenhoffman_pipeline_en

* Add model 2024-06-11-gal_xlm_r_gl

* Add model 2024-06-11-gal_xlm_r_pipeline_gl

* Add model 2024-06-11-afriberta_large_hausa_5e_5_en

* Add model 2024-06-11-afriberta_large_hausa_5e_5_pipeline_en

* Add model 2024-06-11-angela_punc_untranslated_eval_en

* Add model 2024-06-11-xlm_roberta_base_finetuned_panx_german_khadija267_en

* Add model 2024-06-11-angela_punc_untranslated_eval_pipeline_en

* Add model 2024-06-11-xlm_roberta_base_finetuned_panx_italian_100yen_en

* Add model 2024-06-11-xlm_roberta_base_finetuned_panx_italian_100yen_pipeline_en

* Add model 2024-06-11-afriberta_large_finetuned_hausa_2e_4_en

* Add model 2024-06-11-xlm_roberta_base_finetuned_panx_german_donaldyy_en

* Add model 2024-06-11-xlm_roberta_base_finetuned_panx_italian_bobojjhh_en

* Add model 2024-06-11-xlm_roberta_base_finetuned_panx_german_donaldyy_pipeline_en

* Add model 2024-06-11-afriberta_large_finetuned_hausa_2e_4_pipeline_en

* Add model 2024-06-11-xlm_roberta_base_finetuned_panx_german_khadija267_pipeline_en

* Add model 2024-06-11-xlm_roberta_base_finetuned_panx_italian_bobojjhh_pipeline_en

* Add model 2024-06-11-enlm_roberta_conll2003_final_stemmed_en

* Add model 2024-06-11-enlm_roberta_conll2003_final_stemmed_pipeline_en

* Add model 2024-06-11-xlm_r_galen_socialdisner_es

* Add model 2024-06-11-arabnizer_xlmr_panx_arabic_ar

* Add model 2024-06-11-xlm_r_galen_socialdisner_pipeline_es

* Add model 2024-06-11-arabnizer_xlmr_panx_arabic_pipeline_ar

* Add model 2024-06-11-xlm_roberta_base_finetuned_panx_german_french_100yen_en

* Add model 2024-06-11-xlm_roberta_base_finetuned_panx_german_french_jamie613_pipeline_en

* Add model 2024-06-11-xlm_roberta_base_finetuned_panx_german_french_100yen_pipeline_en

* Add model 2024-06-11-spa_enpt_xlm_r_es

* Add model 2024-06-11-spa_enpt_xlm_r_pipeline_es

* Add model 2024-06-11-xlm_roberta_base_finetuned_panx_german_french_jamie613_en

* Add model 2024-06-11-afriberta_base_hausa_5e_5_en

* Add model 2024-06-11-afriberta_base_hausa_5e_5_pipeline_en

* Add model 2024-06-11-tner_xlm_roberta_base_ontonotes5_switchboard_normalized_en

* Add model 2024-06-11-xlm_roberta_base_finetuned_panx_german_hash1360_en

* Add model 2024-06-11-tner_xlm_roberta_base_ontonotes5_switchboard_normalized_pipeline_en

* Add model 2024-06-11-xlm_roberta_base_finetuned_panx_german_hash1360_pipeline_en

* Add model 2024-06-11-xlm_roberta_base_finetuned_panx_german_alkampfer_en

* Add model 2024-06-11-xlm_roberta_base_finetuned_panx_german_alkampfer_pipeline_en

* Add model 2024-06-11-xlm_roberta_base_finetuned_panx_german_matovu_ronald_pipeline_en

* Add model 2024-06-11-xlm_roberta_base_finetuned_panx_german_matovu_ronald_en

* Add model 2024-06-11-xlm_roberta_base_finetuned_panx_german_kenhoffman_en

* Add model 2024-06-11-xlm_roberta_base_finetuned_panx_german_kenhoffman_pipeline_en

* Add model 2024-06-11-xlm_roberta_base_finetuned_panx_german_jzwk_en

* Add model 2024-06-11-xlm_roberta_base_finetuned_panx_english_ajit_transformer_en

* Add model 2024-06-11-cat_ner_xlmr_en

* Add model 2024-06-11-xlm_roberta_base_finetuned_panx_english_ajit_transformer_pipeline_en

* Add model 2024-06-11-xlm_roberta_base_finetuned_panx_french_fraisier_pipeline_en

* Add model 2024-06-11-xlm_roberta_base_finetuned_panx_french_fraisier_en

* Add model 2024-06-11-xlm_roberta_base_finetuned_panx_german_jzwk_pipeline_en

* Add model 2024-06-11-cat_ner_xlmr_pipeline_en

* Add model 2024-06-11-xlm_roberta_base_finetuned_panx_italian_patnelt60_en

* Add model 2024-06-11-xlm_roberta_base_finetuned_panx_italian_patnelt60_pipeline_en

* Add model 2024-06-11-afriberta_base_finetuned_hausa_2e_3_en

* Add model 2024-06-11-afriberta_base_finetuned_hausa_2e_3_pipeline_en

* Add model 2024-06-11-ter_class_5e_5_hausa_en

* Add model 2024-06-11-ter_class_5e_5_hausa_pipeline_en

* Add model 2024-06-11-norwegian_delete_5e_5_hausa_en

* Add model 2024-06-11-norwegian_delete_5e_5_hausa_pipeline_en

* Add model 2024-06-11-xlm_roberta_base_finetuned_panx_french_sbpark_en

* Add model 2024-06-11-xlm_roberta_panx_uzbek_en

* Add model 2024-06-11-xlm_roberta_panx_uzbek_pipeline_en

* Add model 2024-06-11-xlm_roberta_base_finetuned_panx_french_sbpark_pipeline_en

* Add model 2024-06-11-flipped_2e_4_hausa_en

* Add model 2024-06-11-xlm_roberta_base_finetuned_panx_german_french_hanlforever_en

* Add model 2024-06-11-xlmr_base_finetuned_hausa_2e_4_en

* Add model 2024-06-11-flipped_2e_4_hausa_pipeline_en

* Add model 2024-06-11-xlm_roberta_base_finetuned_panx_german_jakobbrunner_en

* Add model 2024-06-11-xlmr_base_finetuned_hausa_2e_4_pipeline_en

* Add model 2024-06-11-xlm_roberta_base_finetuned_panx_english_songys_en

* Add model 2024-06-11-gal_sayula_popoluca_iw_catalan_galician_pipeline_en

* Add model 2024-06-11-gal_sayula_popoluca_iw_catalan_galician_en

* Add model 2024-06-11-xlm_roberta_base_finetuned_panx_german_jakobbrunner_pipeline_en

* Add model 2024-06-11-xlm_roberta_base_finetuned_panx_english_songys_pipeline_en

* Add model 2024-06-11-xlm_roberta_base_finetuned_panx_german_french_hanlforever_pipeline_en

* Add model 2024-06-11-angela_untranslated_shuffle_eval_en

* Add model 2024-06-11-unfiltered_norwegian_delete_hausa_en

* Add model 2024-06-11-unfiltered_norwegian_delete_hausa_pipeline_en

* Add model 2024-06-11-afro_xlmr_base_hausa_5e_5_en

* Add model 2024-06-11-gal_ensp_xlm_r_gl

* Add model 2024-06-11-gal_ensp_xlm_r_pipeline_gl

* Add model 2024-06-11-afro_xlmr_base_hausa_5e_5_pipeline_en

* Add model 2024-06-11-xlm_roberta_base_finetuned_panx_german_french_henryjiang_en

* Add model 2024-06-11-angela_shuffle_punc_eval_en

* Add model 2024-06-11-angela_shuffle_punc_eval_pipeline_en

* Add model 2024-06-11-xlm_roberta_base_finetuned_panx_english_ryatora_pipeline_en

* Add model 2024-06-11-xlm_roberta_base_finetuned_panx_german_french_henryjiang_pipeline_en

* Add model 2024-06-11-xlm_roberta_base_finetuned_panx_english_ryatora_en

* Add model 2024-06-11-angela_untranslated_shuffle_eval_pipeline_en

* Add model 2024-06-11-afro_xlmr_mini_finetuned_hausa_2e_4_en

* Add model 2024-06-11-angela_shuffle_diacritics_eval_en

* Add model 2024-06-11-angela_shuffle_diacritics_eval_pipeline_en

* Add model 2024-06-11-xlm_roberta_base_finetuned_panx_english_jjglilleberg_en

* Add model 2024-06-11-aligned_source_5e_5_en

* Add model 2024-06-11-afro_xlmr_mini_finetuned_hausa_2e_4_pipeline_en

* Add model 2024-06-11-aligned_source_5e_5_pipeline_en

* Add model 2024-06-11-xlm_roberta_base_finetuned_panx_english_jjglilleberg_pipeline_en

* Add model 2024-06-11-angela_punctuation_test_en

* Add model 2024-06-11-xlm_roberta_base_finetuned_panx_german_inniok_en

* Add model 2024-06-11-angela_punctuation_test_pipeline_en

* Add model 2024-06-11-xlm_roberta_base_finetuned_panx_german_inniok_pipeline_en

* Add model 2024-06-11-afro_xlmr_mini_finetuned_igbo_en

* Add model 2024-06-11-afro_xlmr_mini_finetuned_igbo_pipeline_en

* Add model 2024-06-11-angela_diacritics_shuffle_eval_en

* Add model 2024-06-11-angela_untranslated_entities_regular_eval_en

* Add model 2024-06-11-angela_diacritics_shuffle_eval_pipeline_en

* Add model 2024-06-11-angela_untranslated_entities_regular_eval_pipeline_en

* Add model 2024-06-12-sent_roberta_base_en

---------

Co-authored-by: ahmedlone127 <[email protected]>

* 2024-06-13-bge_base_english_sec10k_embed_en (#14331)

* Add model 2024-06-13-bge_base_english_sec10k_embed_en

* Add model 2024-06-13-bge_base_english_sec10k_embed_pipeline_en

* Add model 2024-06-13-bge_base_securiti_dataset_1_v7_en

* Add model 2024-06-13-bge_base_securiti_dataset_1_v7_pipeline_en

* Add model 2024-06-13-bge_base_financial_matryoshka_anikulkar_en

* Add model 2024-06-13-bge_base_financial_matryoshka_anikulkar_pipeline_en

* Add model 2024-06-13-bge_base_securiti_dataset_1_v8_en

* Add model 2024-06-13-bge_base_securiti_dataset_1_v8_pipeline_en

* Add model 2024-06-13-bge_base_financial_matryoshka_hritikmore_en

* Add model 2024-06-13-bge_base_financial_matryoshka_hritikmore_pipeline_en

* Add model 2024-06-13-bge_base_financial_matryoshka_thetayne_en

* Add model 2024-06-13-bge_base_financial_matryoshka_thetayne_pipeline_en

* Add model 2024-06-13-xlmroberta_ner_base_finetuned_arman_fa

* Add model 2024-06-13-xlmroberta_ner_base_finetuned_arman_pipeline_fa

* Add model 2024-06-13-xlmroberta_ner_base_finetuned_ner_wolof_wo

* Add model 2024-06-13-xlmroberta_ner_base_finetuned_ner_kinyarwand_rw

* Add model 2024-06-13-xlmroberta_ner_base_bionlp2004_en

---------

Co-authored-by: ahmedlone127 <[email protected]>

---------

Co-authored-by: jsl-models <[email protected]>
Co-authored-by: ahmedlone127 <[email protected]>
Co-authored-by: prabod <[email protected]>
Co-authored-by: DevinTDHa <[email protected]>
Co-authored-by: Devin Ha <[email protected]>
  • Loading branch information
6 people authored Jul 1, 2024
1 parent 6616323 commit 1a4f329
Show file tree
Hide file tree
Showing 762 changed files with 62,152 additions and 1 deletion.
Original file line number Diff line number Diff line change
@@ -0,0 +1,76 @@
---
layout: model
title: Deepa Panx Model for English
author: SaiDeepaPeri
name: deepa_xlmroberta_ner_large_en_panx
date: 2024-05-06
tags: [en, open_source]
task: Named Entity Recognition
language: en
edition: Spark NLP 4.1.0
spark_version: 3.0
supported: false
annotator: XlmRoBertaForTokenClassification
article_header:
type: cover
use_language_switcher: "Python-Scala-Java"
---

## Description

Named Entity Recognition trained on English panx

## Predicted Entities



{:.btn-box}
<button class="button button-orange" disabled>Live Demo</button>
<button class="button button-orange" disabled>Open in Colab</button>
[Download](https://s3.amazonaws.com/community.johnsnowlabs.com/SaiDeepaPeri/deepa_xlmroberta_ner_large_en_panx_en_4.1.0_3.0_1715017572119.zip){:.button.button-orange.button-orange-trans.arr.button-icon}
[Copy S3 URI](s3://community.johnsnowlabs.com/SaiDeepaPeri/deepa_xlmroberta_ner_large_en_panx_en_4.1.0_3.0_1715017572119.zip){:.button.button-orange.button-orange-trans.button-icon.button-copy-s3}

## How to use



<div class="tabs-box" markdown="1">
{% include programmingLanguageSelectScalaPythonNLU.html %}
```python
documentAssembler = DocumentAssembler() \
.setInputCol("text") \
.setOutputCol("document")

tokenizer = Tokenizer() \
.setInputCols(["document"]) \
.setOutputCol("token")

token_classifier = XlmRoBertaForTokenClassification.pretrained("deepa_xlmroberta_ner_large_en_panx", "en") \
.setInputCols(["document", "token"]) \
.setOutputCol("ner")

ner_converter = NerConverter() \
.setInputCols(["document", "token", "ner"]) \
.setOutputCol("ner_chunk")

pipeline = Pipeline(stages=[documentAssembler, tokenizer, token_classifier, ner_converter])

```

</div>

{:.model-param}
## Model Information

{:.table-model}
|---|---|
|Model Name:|deepa_xlmroberta_ner_large_en_panx|
|Compatibility:|Spark NLP 4.1.0+|
|License:|Open Source|
|Edition:|Community|
|Input Labels:|[document, token]|
|Output Labels:|[ner]|
|Language:|en|
|Size:|1.8 GB|
|Case sensitive:|true|
|Max sentence length:|256|
Original file line number Diff line number Diff line change
@@ -0,0 +1,78 @@
---
layout: model
title: "Deepa NER XLMRoberta Large Model : deepa_xlmroberta_ner_large_panx"
author: SaiDeepaPeri
name: deepa_xlmroberta_ner_large_panx_dataset
date: 2024-05-06
tags: [en, open_source]
task: Named Entity Recognition
language: en
edition: Spark NLP 4.1.0
spark_version: 3.0
supported: false
annotator: XlmRoBertaForTokenClassification
article_header:
type: cover
use_language_switcher: "Python-Scala-Java"
---

## Description

NER model XLM Roberta Large Model

## Predicted Entities



{:.btn-box}
<button class="button button-orange" disabled>Live Demo</button>
<button class="button button-orange" disabled>Open in Colab</button>
[Download](https://s3.amazonaws.com/community.johnsnowlabs.com/SaiDeepaPeri/deepa_xlmroberta_ner_large_panx_dataset_en_4.1.0_3.0_1715028210601.zip){:.button.button-orange.button-orange-trans.arr.button-icon}
[Copy S3 URI](s3://community.johnsnowlabs.com/SaiDeepaPeri/deepa_xlmroberta_ner_large_panx_dataset_en_4.1.0_3.0_1715028210601.zip){:.button.button-orange.button-orange-trans.button-icon.button-copy-s3}

## How to use



<div class="tabs-box" markdown="1">
{% include programmingLanguageSelectScalaPythonNLU.html %}
```python
documentAssembler = DocumentAssembler() \
.setInputCol("text") \
.setOutputCol("document")

# Create a custom Tokenizer that splits text based on spaces
tokenizer = RegexTokenizer() \
.setInputCols(["document"]) \
.setOutputCol("token").setPattern("\\s+") \

# deepa_xlmroberta_ner_large_en_panx
token_classifier = XlmRoBertaForTokenClassification.pretrained("deepa_xlmroberta_ner_large_panx", "en") \
.setInputCols(["document", "token"]) \
.setOutputCol("ner")

ner_converter = NerConverter() \
.setInputCols(["document", "token", "ner"]) \
.setOutputCol("ner_chunk")

pipeline = Pipeline(stages=[documentAssembler, tokenizer, token_classifier, ner_converter])

```

</div>

{:.model-param}
## Model Information

{:.table-model}
|---|---|
|Model Name:|deepa_xlmroberta_ner_large_panx_dataset|
|Compatibility:|Spark NLP 4.1.0+|
|License:|Open Source|
|Edition:|Community|
|Input Labels:|[document, token]|
|Output Labels:|[ner]|
|Language:|en|
|Size:|1.8 GB|
|Case sensitive:|true|
|Max sentence length:|256|
2 changes: 1 addition & 1 deletion docs/_posts/ahmedlone127/2024-02-11-bge_m3_xx.md
Original file line number Diff line number Diff line change
Expand Up @@ -68,7 +68,7 @@ val sentencerDL = SentenceDetectorDLModel.pretrained("sentence_detector_dl", "xx
.setOutputCol("sentence")

val embeddings = XlmRoBertaSentenceEmbeddings
.pretrained("bge_m3", "xx")
.pretrained("bge_m3 ", "xx")
.setInputCols(Array("sentence"))
.setOutputCol("embeddings")

Expand Down
Original file line number Diff line number Diff line change
@@ -0,0 +1,87 @@
---
layout: model
title: English baai_bge_base_english_nowr_1_1 BGEEmbeddings from alexakkol
author: John Snow Labs
name: baai_bge_base_english_nowr_1_1
date: 2024-06-10
tags: [en, open_source, onnx, embeddings, bge]
task: Embeddings
language: en
edition: Spark NLP 5.4.0
spark_version: 3.0
supported: true
engine: onnx
annotator: BGEEmbeddings
article_header:
type: cover
use_language_switcher: "Python-Scala-Java"
---

## Description

Pretrained BGEEmbeddings model, adapted from Hugging Face and curated to provide scalability and production-readiness using Spark NLP.`baai_bge_base_english_nowr_1_1` is a English model originally trained by alexakkol.

{:.btn-box}
<button class="button button-orange" disabled>Live Demo</button>
<button class="button button-orange" disabled>Open in Colab</button>
[Download](https://s3.amazonaws.com/auxdata.johnsnowlabs.com/public/models/baai_bge_base_english_nowr_1_1_en_5.4.0_3.0_1718060836858.zip){:.button.button-orange.button-orange-trans.arr.button-icon}
[Copy S3 URI](s3://auxdata.johnsnowlabs.com/public/models/baai_bge_base_english_nowr_1_1_en_5.4.0_3.0_1718060836858.zip){:.button.button-orange.button-orange-trans.button-icon.button-copy-s3}

## How to use



<div class="tabs-box" markdown="1">
{% include programmingLanguageSelectScalaPythonNLU.html %}
```python

documentAssembler = DocumentAssembler() \
.setInputCol("text") \
.setOutputCol("document")

embeddings = BGEEmbeddings.pretrained("baai_bge_base_english_nowr_1_1","en") \
.setInputCols(["document"]) \
.setOutputCol("embeddings")

pipeline = Pipeline().setStages([documentAssembler, embeddings])
data = spark.createDataFrame([["I love spark-nlp"]]).toDF("text")
pipelineModel = pipeline.fit(data)
pipelineDF = pipelineModel.transform(data)

```
```scala

val documentAssembler = new DocumentAssembler()
.setInputCol("text")
.setOutputCol("document")


val embeddings = BGEEmbeddings.pretrained("baai_bge_base_english_nowr_1_1","en")
.setInputCols(Array("document"))
.setOutputCol("embeddings")

val pipeline = new Pipeline().setStages(Array(documentAssembler, embeddings))
val data = Seq("I love spark-nlp).toDS.toDF("text")
val pipelineModel = pipeline.fit(data)
val pipelineDF = pipelineModel.transform(data)

```
</div>

{:.model-param}
## Model Information

{:.table-model}
|---|---|
|Model Name:|baai_bge_base_english_nowr_1_1|
|Compatibility:|Spark NLP 5.4.0+|
|License:|Open Source|
|Edition:|Official|
|Input Labels:|[document]|
|Output Labels:|[bge]|
|Language:|en|
|Size:|381.8 MB|

## References

https://huggingface.co/alexakkol/BAAI-bge-base-en-nowr-1-1
Original file line number Diff line number Diff line change
@@ -0,0 +1,69 @@
---
layout: model
title: English baai_bge_base_english_nowr_1_1_pipeline pipeline BGEEmbeddings from alexakkol
author: John Snow Labs
name: baai_bge_base_english_nowr_1_1_pipeline
date: 2024-06-10
tags: [en, open_source, pipeline, onnx]
task: Embeddings
language: en
edition: Spark NLP 5.4.0
spark_version: 3.0
supported: true
annotator: PipelineModel
article_header:
type: cover
use_language_switcher: "Python-Scala-Java"
---

## Description

Pretrained BGEEmbeddings, adapted from Hugging Face and curated to provide scalability and production-readiness using Spark NLP.`baai_bge_base_english_nowr_1_1_pipeline` is a English model originally trained by alexakkol.

{:.btn-box}
<button class="button button-orange" disabled>Live Demo</button>
<button class="button button-orange" disabled>Open in Colab</button>
[Download](https://s3.amazonaws.com/auxdata.johnsnowlabs.com/public/models/baai_bge_base_english_nowr_1_1_pipeline_en_5.4.0_3.0_1718060870292.zip){:.button.button-orange.button-orange-trans.arr.button-icon}
[Copy S3 URI](s3://auxdata.johnsnowlabs.com/public/models/baai_bge_base_english_nowr_1_1_pipeline_en_5.4.0_3.0_1718060870292.zip){:.button.button-orange.button-orange-trans.button-icon.button-copy-s3}

## How to use



<div class="tabs-box" markdown="1">
{% include programmingLanguageSelectScalaPythonNLU.html %}
```python

pipeline = PretrainedPipeline("baai_bge_base_english_nowr_1_1_pipeline", lang = "en")
annotations = pipeline.transform(df)

```
```scala

val pipeline = new PretrainedPipeline("baai_bge_base_english_nowr_1_1_pipeline", lang = "en")
val annotations = pipeline.transform(df)

```
</div>

{:.model-param}
## Model Information

{:.table-model}
|---|---|
|Model Name:|baai_bge_base_english_nowr_1_1_pipeline|
|Type:|pipeline|
|Compatibility:|Spark NLP 5.4.0+|
|License:|Open Source|
|Edition:|Official|
|Language:|en|
|Size:|381.8 MB|

## References

https://huggingface.co/alexakkol/BAAI-bge-base-en-nowr-1-1

## Included Models

- DocumentAssembler
- BGEEmbeddings
Loading

0 comments on commit 1a4f329

Please sign in to comment.