We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
The MLX implementation
Mac OS X
Metal with MLX
When running python -m moshi_mlx.local_web -q 4
python -m moshi_mlx.local_web -q 4
(moshi) moshi % python -m moshi_mlx.local_web -q 4 [Info] [SERVER] loading text tokenizer /Users/.cache/huggingface/hub/models--kyutai--moshiko-mlx-q4/snapshots/18e4df760a34dd5977a34517d7d1580e07acbb2f1/tokenizer_spm_32k_3.model [Info] [SERVER] loading weights /Users/.cache/huggingface/hub/models--kyutai--moshiko-mlx-q4/snapshots/18e4df760a34dd5977a34517d7d1580e07acbb2f1/model.q4.safetensors [Info] [SERVER] weights loaded [Info] [SERVER] model warmed up [Info] [SERVER] connected! [Info] [CLIENT] received 'start' from server, starting... [Info] retrieving the static content [Info] serving static content from /Users/.cache/huggingface/hub/models--kyutai--moshi-artifacts/snapshots/8481e95f73827e4e70ac7311c12b0be099276182/dist [Info] listening to http://localhost:8998 [Info] opening browser at http://localhost:8998 [Info] accepted connection error in encoder thread narrow invalid args start + len > dim_len: [4096, 32], dim: 0, start: 4096, len:2 error in encoder thread narrow invalid args start + len > dim_len: [4096, 32], dim: 0, start: 4096, len:2 error in encoder thread narrow invalid args start + len > dim_len: [4096, 32], dim: 0, start: 4096, len:2 error in encoder thread narrow invalid args start + len > dim_len: [4096, 32], dim: 0, start: 4096, len:2 error in encoder thread narrow invalid args start + len > dim_len: [4096, 32], dim: 0, start: 4096, len:2 error in encoder thread narrow invalid args start + len > dim_len: [4096, 32], dim: 0, start: 4096, len:2 | [LAG]error in encoder thread narrow invalid args start + len > dim_len: [4096, 32], dim: 0, start: 4096, len:2 [LAG]error in encoder thread narrow invalid args start + len > dim_len: [4096, 32], dim: 0, start: 4096, len:2 [LAG]error in encoder thread narrow invalid args start + len > dim_len: [4096, 32], dim: 0, start: 4096, len:2
$ pip install moshi_mlx $ python -m moshi_mlx.local_web -q 4
System Information:
Hardware Information:
The text was updated successfully, but these errors were encountered:
reaches the max cache size: #51
Sorry, something went wrong.
No branches or pull requests
Backend impacted
The MLX implementation
Operating system
Mac OS X
Hardware
Metal with MLX
Description
When running
python -m moshi_mlx.local_web -q 4
(moshi) moshi % python -m moshi_mlx.local_web -q 4
[Info] [SERVER] loading text tokenizer /Users/.cache/huggingface/hub/models--kyutai--moshiko-mlx-q4/snapshots/18e4df760a34dd5977a34517d7d1580e07acbb2f1/tokenizer_spm_32k_3.model
[Info] [SERVER] loading weights /Users/.cache/huggingface/hub/models--kyutai--moshiko-mlx-q4/snapshots/18e4df760a34dd5977a34517d7d1580e07acbb2f1/model.q4.safetensors
[Info] [SERVER] weights loaded
[Info] [SERVER] model warmed up
[Info] [SERVER] connected!
[Info] [CLIENT] received 'start' from server, starting...
[Info] retrieving the static content
[Info] serving static content from /Users/.cache/huggingface/hub/models--kyutai--moshi-artifacts/snapshots/8481e95f73827e4e70ac7311c12b0be099276182/dist
[Info] listening to http://localhost:8998
[Info] opening browser at http://localhost:8998
[Info] accepted connection
error in encoder thread narrow invalid args start + len > dim_len: [4096, 32], dim: 0, start: 4096, len:2
error in encoder thread narrow invalid args start + len > dim_len: [4096, 32], dim: 0, start: 4096, len:2
error in encoder thread narrow invalid args start + len > dim_len: [4096, 32], dim: 0, start: 4096, len:2
error in encoder thread narrow invalid args start + len > dim_len: [4096, 32], dim: 0, start: 4096, len:2
error in encoder thread narrow invalid args start + len > dim_len: [4096, 32], dim: 0, start: 4096, len:2
error in encoder thread narrow invalid args start + len > dim_len: [4096, 32], dim: 0, start: 4096, len:2
| [LAG]error in encoder thread narrow invalid args start + len > dim_len: [4096, 32], dim: 0, start: 4096, len:2
[LAG]error in encoder thread narrow invalid args start + len > dim_len: [4096, 32], dim: 0, start: 4096, len:2
[LAG]error in encoder thread narrow invalid args start + len > dim_len: [4096, 32], dim: 0, start: 4096, len:2
Extra information
$ pip install moshi_mlx
$ python -m moshi_mlx.local_web -q 4
Environment
System Information:
Hardware Information:
The text was updated successfully, but these errors were encountered: