Skip to content

Navigation Menu

Explore
By company size
By use case
By industry
View all solutions
Topics
- AI
- DevOps
- Security
- Software Development
- View all
Explore
- GitHub Sponsors
  Fund open source developers
- The ReadME Project
  GitHub community articles
Repositories
- Enterprise platform
  AI-powered developer platform
Available add-ons
Pricing

Search code, repositories, users, issues, pull requests...

Search

Clear

Search syntax tips

Provide feedback

We read every piece of feedback, and take your input very seriously.

Include my email address so I can be contacted

Saved searches

Use saved searches to filter your results more quickly

Name

Query

To see all available qualifiers, see our documentation.

You signed in with another tab or window. Reload to refresh your session. You signed out in another tab or window. Reload to refresh your session. You switched accounts on another tab or window. Reload to refresh your session.

Dismiss alert

OpenNMT / CTranslate2 Public

Notifications You must be signed in to change notification settings
Fork 305
Star 3.4k

Code
Issues 166
Pull requests 27
Actions
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Actions
Security
Insights

Releases: OpenNMT/CTranslate2

Releases · OpenNMT/CTranslate2

CTranslate2 1.5.1

06 Feb 16:25

guillaumekln

Compare

Choose a tag to compare

Loading

CTranslate2 1.5.1

CTranslate2 1.5.1

Fixes and improvements

Fix INT8 translation on CPU with vocabulary map

Assets 2

Loading

All reactions

CTranslate2 1.5.0

06 Feb 15:20

guillaumekln

Compare

Choose a tag to compare

Loading

CTranslate2 1.5.0

CTranslate2 1.5.0

New features

[C++] Add max_batch_size translation options for single translators

Fixes and improvements

Improve INT8 performance on CPU
Enable INT8 support on default Intel MKL build
Simplify project dependencies:
- Replace boost::program_options with cxxopts for client options
- Include header-only dependencies as Git submodules (cxxopts, cub, and thrust)
- Remove MKL-DNN
Harmonize Python/C++ default values:
- [Python] Change default beam size from 4 to 2
- [C++] Load models on the CPU by default

Assets 2

Loading

All reactions

CTranslate2 1.4.0

21 Jan 08:54

guillaumekln

Compare

Choose a tag to compare

Loading

CTranslate2 1.4.0

CTranslate2 1.4.0

New features

Publish a package on PyPI (without GPU support)
Add method to convert OpenNMT-tf models directly from a dictionary of variables
Return statistics from Python method Translator.translate_file
Add set_model methods to support changing models without creating a new Translator
Add a contains_model function to check whether a directory could contain a CTranslate2 model

Assets 2

Loading

All reactions

CTranslate2 1.3.0

14 Jan 17:31

guillaumekln

Compare

Choose a tag to compare

Loading

CTranslate2 1.3.0

CTranslate2 1.3.0

New features

Support random sampling (see the sampling_topk and sampling_temperature translation options)
CT2_CUDA_CACHING_ALLOCATOR_CONFIG environment variable to configure the CUDA caching allocator

Fixes and improvements

Fix incorrect translations on Windows due to incompatibility between the compiler OpenMP and Intel OpenMP
Release cuDNN/cuBLAS/TensorRT handles on thread exit when destroying a TranslatorPool
Remove use of --{start,end}-group compiler options when compiling on Mac OS
Update Intel MKL to 2020.0 in Docker images
Load vocabulary assets for SavedModel exported with OpenNMT-tf 2.5 and above

Assets 2

Loading

All reactions

CTranslate2 1.2.3

11 Dec 10:26

guillaumekln

Compare

Choose a tag to compare

Loading

CTranslate2 1.2.3

CTranslate2 1.2.3

Fixes and improvements

Improve translator robustness on empty batch and inputs
Speed optimization for LayerNorm
Check vocabulary size when converting OpenNMT-tf models
Add more samples in the execution profiling output which now supports nested functions

Assets 2

Loading

All reactions

CTranslate2 1.2.2

25 Nov 13:23

guillaumekln

Compare

Choose a tag to compare

Loading

CTranslate2 1.2.2

CTranslate2 1.2.2

Fixes and improvements

Fix PositionEncoder internal state that was shared with other instances on the same thread
Replace Boost.Python by pybind11
Include a Python source distribution in the Docker images

Assets 2

Loading

All reactions

CTranslate2 1.2.1

06 Nov 12:19

guillaumekln

Compare

Choose a tag to compare

Loading

CTranslate2 1.2.1

CTranslate2 1.2.1

Fixes and improvements

Avoid copying decoder states when possible to improve decoding performance (10% to 20% faster)
Fix execution profiling on GPU (device was not synchronized before measuring the time)
Include Mul operation in profiling report
Add a Python 3 wheel in Ubuntu Docker images

Assets 2

Loading

All reactions

CTranslate2 1.2.0

28 Oct 11:03

guillaumekln

Compare

Choose a tag to compare

Loading

CTranslate2 1.2.0

CTranslate2 1.2.0

New features

Accept Transformer models with custom number of layers and heads
--log-profiling client option to profile ops execution

Fixes and improvements

Fix conversion error for models having 2 different weights with the same values
Fix invalid MKL function override after a refactoring
Add more information and context to several error messages

Assets 2

Loading

All reactions

CTranslate2 1.1.0

18 Oct 14:29

guillaumekln

Compare

Choose a tag to compare

Loading

CTranslate2 1.1.0

CTranslate2 1.1.0

New features

New Docker images: latest-ubuntu16-gpu, latest-ubuntu18, latest-ubuntu18-gpu
Support OpenNMT-tf Transformer models with shared embeddings
Update to TensorRT 6
Make OpenMP runtime configurable

Fixes and improvements

Reduce the size of models with shared weights on disk and in memory
Shared words vocabulary is no longer duplicated on disk and in memory
Improve performance of translation with a vocabulary map on GPU
Statically link against Intel MKL
Remove some implementation details from public headers

Assets 2

Loading

All reactions

CTranslate2 1.0.1

08 Oct 11:46

guillaumekln

Compare

Choose a tag to compare

Loading

CTranslate2 1.0.1

CTranslate2 1.0.1

Fixes and improvements

Fix loading of newer OpenNMT-py models
Promote FP16 to FP32 in model converter scripts
Improve INT8 performance on CPU and GPU
Improve performance on GPU by fusing the layer normalization operation x * gamma + beta
Enable INT8 and INT16 computation on all platforms with Intel MKL 2019.5 and above

Assets 2

Loading

All reactions

Previous 1 2 … 9 10 11 12 13 Next

Footer

© 2024 GitHub, Inc.

Footer navigation

Terms
Privacy
Security
Status
Docs
Contact

You can’t perform that action at this time.