SpokenNLP: The official repository for codebases on a wide variety of research projects developed by the SpokenNLP team of Speech Lab, Alibaba Group.
- [2024-02-05]:
SLD
was accepted by ICASSP 2024. It introduces SLD: a novel approach which applies a KL divergence loss with smoothed labels on speech tokens for Discrete-token-based ASR. - [2023-10-23]:
Ditto
was accepted by EMNLP 2023. It introduces Ditto: a learning-free approach that uses model-based importance estimations to weight words and compute sentence embeddings from pre-trained model representations. - [2023-10-07]:
Improving Long Document Topic Segmentation Models With Enhanced Coherence Modeling
was accepted by EMNLP 2023. It enhances the pretrained language model’s ability to capture coherence from both structure and similarity perspectives to further improve the topic segmentation performance. - [2023-05-22]:
PoNet
are submitted to huggingface hub. PoNet can now be used directly through the Transformers library. - [2022-12-02]:
alimeeting4mug
released the official baseline system codebase for ICASSP2023 General Meeting Understanding and Generation Challenge (MUG)! - [2022-02-24]:
MDERank
was accepted by Findings of ACL 2022. It is a Masked Document Embedding Rank approach for unsupervised keyphrase extraction, which outperforms state-of-the-art unsupervised keyphrase extraction approaches, especially on long documents. - [2022-01-21]:
PoNet
was accepted by ICLR 2022. It is a novel Pooling Network (PoNet) for token mixing in long sequences with linear complexity, which achieves a good balance between transfer learning capability and accuracy and complexity for long sequence modeling. Models are released at Modelscope (English and Chinese). - [2021-09-11]:
SeqModel
was accepted by IEEE ASRU 2021. It is a sequence model with self-adaptive sliding window for efficient spoken document segmentation. A new Chinese Wikipedia-based document segmentation dataset Wiki-zh was released. Models are released at Modelscope (English and Chinese). - [2019-02-28]:
JointBERT
was proposed for joint intent classification and slot filling with BERT. The third-party PyTorch implementation of JointBERT is available. - [2018-10-17]:
ESIM
ranks the top on both datasets on DSTC7 Noetic End-to-end Response Selection track !
SpokenNLP is released under the Apache License 2.0. This project contains various third-party components under other open source licenses.