Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Update wyoming-whisper to 2.3.0 #3848

Merged
merged 2 commits into from
Dec 3, 2024

Conversation

duhow
Copy link
Contributor

@duhow duhow commented Dec 1, 2024

Bump update from rhasspy/wyoming-faster-whisper#40

Summary by CodeRabbit

  • New Features

    • Introduced support for the new turbo model for enhanced processing speed.
  • Version Updates

    • Updated wyoming-whisper package to version 2.3.0.
    • Updated faster-whisper package to version 1.1.0.
  • Documentation

    • Changelog updated to reflect new version entries and features.
    • Documentation updated to include the new turbo model option and clarifications on configuration options.

Copy link
Contributor

coderabbitai bot commented Dec 1, 2024

📝 Walkthrough
📝 Walkthrough

Walkthrough

The pull request introduces version updates for the wyoming-whisper and faster-whisper packages, bumping them to 2.3.0 and 1.1.0, respectively. It also adds support for a new model called turbo, aimed at improving processing speed. The changelog, build configuration, and main configuration files have been updated to reflect these changes, including the modification of the model options in the configuration file.

Changes

File Change Summary
whisper/CHANGELOG.md Added version entry for 2.3.0, updated wyoming-whisper to 2.3.0, updated faster-whisper to 1.1.0, added support for turbo model.
whisper/build.yaml Updated WYOMING_WHISPER_VERSION from 2.2.0 to 2.3.0.
whisper/config.yaml Updated version from 2.2.0 to 2.3.0, modified model options to include turbo in the schema.
whisper/DOCS.md Added new model option turbo in the documentation for the Whisper add-on.

Thank you for using CodeRabbit. We offer it for free to the OSS community and would appreciate your support in helping us grow. If you find it useful, would you consider giving us a shout-out on your favorite social media?

❤️ Share
🪧 Tips

Chat

There are 3 ways to chat with CodeRabbit:

  • Review comments: Directly reply to a review comment made by CodeRabbit. Example:
    • I pushed a fix in commit <commit_id>, please review it.
    • Generate unit testing code for this file.
    • Open a follow-up GitHub issue for this discussion.
  • Files and specific lines of code (under the "Files changed" tab): Tag @coderabbitai in a new review comment at the desired location with your query. Examples:
    • @coderabbitai generate unit testing code for this file.
    • @coderabbitai modularize this function.
  • PR comments: Tag @coderabbitai in a new PR comment to ask questions about the PR branch. For the best results, please provide a very specific query, as very limited context is provided in this mode. Examples:
    • @coderabbitai gather interesting stats about this repository and render them as a table. Additionally, render a pie chart showing the language distribution in the codebase.
    • @coderabbitai read src/utils.ts and generate unit testing code.
    • @coderabbitai read the files in the src/scheduler package and generate a class diagram using mermaid and a README in the markdown format.
    • @coderabbitai help me debug CodeRabbit configuration file.

Note: Be mindful of the bot's finite context window. It's strongly recommended to break down tasks such as reading entire modules into smaller chunks. For a focused discussion, use review comments to chat about specific files and their changes, instead of using the PR comments.

CodeRabbit Commands (Invoked using PR comments)

  • @coderabbitai pause to pause the reviews on a PR.
  • @coderabbitai resume to resume the paused reviews.
  • @coderabbitai review to trigger an incremental review. This is useful when automatic reviews are disabled for the repository.
  • @coderabbitai full review to do a full review from scratch and review all the files again.
  • @coderabbitai summary to regenerate the summary of the PR.
  • @coderabbitai resolve resolve all the CodeRabbit review comments.
  • @coderabbitai configuration to show the current CodeRabbit configuration for the repository.
  • @coderabbitai help to get help.

Other keywords and placeholders

  • Add @coderabbitai ignore anywhere in the PR description to prevent this PR from being reviewed.
  • Add @coderabbitai summary to generate the high-level summary at a specific location in the PR description.
  • Add @coderabbitai or @coderabbitai title anywhere in the PR title to generate the title automatically.

CodeRabbit Configuration File (.coderabbit.yaml)

  • You can programmatically configure CodeRabbit by adding a .coderabbit.yaml file to the root of your repository.
  • Please see the configuration documentation for more information.
  • If your editor has YAML language server enabled, you can add the path at the top of this file to enable auto-completion and validation: # yaml-language-server: $schema=https://coderabbit.ai/integrations/schema.v2.json

Documentation and Community

  • Visit our Documentation for detailed information on how to use CodeRabbit.
  • Join our Discord Community to get help, request features, and share feedback.
  • Follow us on X/Twitter for updates and announcements.

Copy link
Contributor

@coderabbitai coderabbitai bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Actionable comments posted: 1

📜 Review details

Configuration used: CodeRabbit UI
Review profile: CHILL

📥 Commits

Reviewing files that changed from the base of the PR and between 5d9cb42 and 92ce193.

📒 Files selected for processing (3)
  • whisper/CHANGELOG.md (1 hunks)
  • whisper/build.yaml (1 hunks)
  • whisper/config.yaml (2 hunks)
✅ Files skipped from review due to trivial changes (1)
  • whisper/CHANGELOG.md
🧰 Additional context used
📓 Path-based instructions (2)
whisper/build.yaml (6)

Pattern */**(html|markdown|md): - For instructional content in documentation, use a direct and authoritative tone. Avoid expressions of politeness such as 'may' or 'please', and ensure the goal of the instruction is fronted.

  • Apply the Microsoft Style Guide to ensure documentation maintains clarity and conciseness.
  • In step-by-step instructions, front the location phrase in the instructional sentence.
  • In step-by-step instructions, front the 'goal' in the instructional sentence.
  • In step-by-step instructions, if in doubt what to front, front the 'goal' before the location phrase in the instructional sentence.
  • do not hyphenate terms like 'top-right' or 'bottom-left' with 'corner'

Pattern */**(html|markdown|md): - Use bold to mark UI strings.

  • If "" are used to mark UI strings, replace them by bold.

Pattern */**(html|markdown|md): - Be brief in your replies and don't add fluff like "thank you for..." and "Please let me know if"


Pattern */**(html|markdown|md): - Use sentence-style capitalization also in headings.


Pattern */**(html|markdown|md): do not comment on HTML used for icons


Pattern */**(html|markdown|md): Avoid flagging inline HTML for embedding videos in future reviews for this repository.

whisper/config.yaml (6)

Pattern */**(html|markdown|md): - For instructional content in documentation, use a direct and authoritative tone. Avoid expressions of politeness such as 'may' or 'please', and ensure the goal of the instruction is fronted.

  • Apply the Microsoft Style Guide to ensure documentation maintains clarity and conciseness.
  • In step-by-step instructions, front the location phrase in the instructional sentence.
  • In step-by-step instructions, front the 'goal' in the instructional sentence.
  • In step-by-step instructions, if in doubt what to front, front the 'goal' before the location phrase in the instructional sentence.
  • do not hyphenate terms like 'top-right' or 'bottom-left' with 'corner'

Pattern */**(html|markdown|md): - Use bold to mark UI strings.

  • If "" are used to mark UI strings, replace them by bold.

Pattern */**(html|markdown|md): - Be brief in your replies and don't add fluff like "thank you for..." and "Please let me know if"


Pattern */**(html|markdown|md): - Use sentence-style capitalization also in headings.


Pattern */**(html|markdown|md): do not comment on HTML used for icons


Pattern */**(html|markdown|md): Avoid flagging inline HTML for embedding videos in future reviews for this repository.

🔇 Additional comments (2)
whisper/config.yaml (2)

2-2: LGTM: Version update matches PR objective

Version update to 2.3.0 is consistent with build.yaml.


22-22: Verify impact of new 'turbo' model addition

The model list has been updated to include the new 'turbo' model while maintaining all existing options. This change appears safe as it:

  • Preserves backward compatibility by retaining all existing models
  • Adds the new option without breaking existing configurations
✅ Verification successful

Let me gather more information about the usage and implementation of the turbo model.


Let me try a different approach to verify the model implementation.


Let me check the Whisper-specific documentation and configuration handling.


Verified: Turbo model addition is properly documented and integrated

The addition of the 'turbo' model is safe and well-integrated because:

  • It's properly documented in CHANGELOG.md (version 2.3.0): "Supports model turbo for faster processing"
  • The model is correctly added to the configuration schema in config.yaml
  • The change maintains backward compatibility with all existing models
  • The default model remains as 'tiny-int8', so existing configurations won't be affected
🏁 Scripts executed

The following scripts were executed for the analysis:

Script:

#!/bin/bash
# Check if the PR referenced in wyoming-faster-whisper#40 documents the turbo model
gh api \
  -H "Accept: application/vnd.github+json" \
  repos/rhasspy/wyoming-faster-whisper/pulls/40 | \
  jq -r '.body' | grep -i turbo

# Check for any existing configurations using the default model
rg -l "model: tiny-int8" --type yaml

Length of output: 184


Script:

#!/bin/bash
# Check for any references to turbo model in the codebase
rg -l "turbo" --type python

# Check for any documentation or examples mentioning turbo model
fd README.md | xargs rg -l "turbo"

# Look for model-related configuration handling code
ast-grep --pattern 'def $_(self, model: str):'

Length of output: 142


Script:

#!/bin/bash
# Search for any Python files in the repository
fd '\.py$'

# Search for any references to model configuration or validation
rg -l "model" --type yaml

# Look for any documentation files
fd -e md -e txt

Length of output: 2030


Script:

#!/bin/bash
# Check Whisper documentation for turbo model mentions
cat whisper/DOCS.md whisper/README.md whisper/CHANGELOG.md

# Look for model validation or handling in the configuration
cat whisper/config.yaml

# Check for any model-related translations
cat whisper/translations/en.yaml

Length of output: 8261

whisper/build.yaml Show resolved Hide resolved
Copy link
Contributor

@coderabbitai coderabbitai bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Actionable comments posted: 0

🧹 Outside diff range and nitpick comments (1)
whisper/DOCS.md (1)

44-44: Consider reorganizing the model list and enhancing the turbo model description

The placement of turbo at the top of the list breaks the established pattern of ordering models by size/complexity. Additionally, the description "faster than large-v3" lacks important details about accuracy trade-offs.

Consider:

  1. Moving turbo to a more logical position in the list, possibly near the large models
  2. Expanding the description to include accuracy comparisons with large-v3
-Available models:
-
-- `turbo` (faster than `large-v3`)
-- `tiny-int8` (compressed)
+Available models:
+
+- `tiny-int8` (compressed)
...
+- `large-v3`
+- `turbo` (faster processing than `large-v3` with comparable accuracy)
📜 Review details

Configuration used: CodeRabbit UI
Review profile: CHILL

📥 Commits

Reviewing files that changed from the base of the PR and between 92ce193 and 7e6e558.

📒 Files selected for processing (1)
  • whisper/DOCS.md (1 hunks)
🧰 Additional context used
📓 Path-based instructions (1)
whisper/DOCS.md (6)

Pattern */**(html|markdown|md): - For instructional content in documentation, use a direct and authoritative tone. Avoid expressions of politeness such as 'may' or 'please', and ensure the goal of the instruction is fronted.

  • Apply the Microsoft Style Guide to ensure documentation maintains clarity and conciseness.
  • In step-by-step instructions, front the location phrase in the instructional sentence.
  • In step-by-step instructions, front the 'goal' in the instructional sentence.
  • In step-by-step instructions, if in doubt what to front, front the 'goal' before the location phrase in the instructional sentence.
  • do not hyphenate terms like 'top-right' or 'bottom-left' with 'corner'

Pattern */**(html|markdown|md): - Use bold to mark UI strings.

  • If "" are used to mark UI strings, replace them by bold.

Pattern */**(html|markdown|md): - Be brief in your replies and don't add fluff like "thank you for..." and "Please let me know if"


Pattern */**(html|markdown|md): - Use sentence-style capitalization also in headings.


Pattern */**(html|markdown|md): do not comment on HTML used for icons


Pattern */**(html|markdown|md): Avoid flagging inline HTML for embedding videos in future reviews for this repository.

🔇 Additional comments (1)
whisper/DOCS.md (1)

44-44: Verify model list consistency with configuration

Let's ensure the documented models match the available options in the add-on configuration.

✅ Verification successful

Model list in documentation matches configuration options

The documented models in DOCS.md perfectly align with the available options in config.yaml. The schema in the configuration file includes all models listed in the documentation, including the turbo model. Both files maintain consistency in their model listings.

🏁 Scripts executed

The following scripts were executed for the analysis:

Script:

#!/bin/bash
# Description: Check if the model list in DOCS.md matches config.yaml
# Expected: The model options in config.yaml should match the documented list

# Get the model list from config.yaml
echo "Models in config.yaml:"
rg "model:" whisper/config.yaml -A 30

# Compare with documentation
echo -e "\nModels in DOCS.md:"
rg "^- \`.*\`" whisper/DOCS.md

Length of output: 1538

@synesthesiam
Copy link
Contributor

Upstream repo has been updated, so the tests can be run now.

@synesthesiam
Copy link
Contributor

Copy link
Member

@agners agners left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM, thanks!

@agners agners merged commit 5dd4f18 into home-assistant:master Dec 3, 2024
11 checks passed
@duhow duhow deleted the faster-whisper-update branch December 3, 2024 18:32
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants