Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Chore(linting): Add ruff lint rules #22

Closed
wants to merge 5 commits into from

Conversation

aaronsteers
Copy link
Contributor

@aaronsteers aaronsteers commented Nov 11, 2024

Summary by CodeRabbit

  • New Features

    • Enhanced dependency management and linting configurations.
    • Introduced new sections for improved linting rules and behaviors.
  • Improvements

    • Configured import sorting and complexity thresholds for better code quality.
    • Adjusted linting settings to relax certain limits and improve usability.

Copy link
Contributor

coderabbitai bot commented Nov 11, 2024

📝 Walkthrough
📝 Walkthrough

Walkthrough

The pyproject.toml file has undergone significant updates to improve dependency management and linting configurations. Key additions include a new [tool.ruff] section for linting, specifying included files and a preview option. Multiple subsections under [tool.ruff.lint] have been introduced, detailing various linting rules and configurations, including adjustments for pylint, isort, mccabe, pycodestyle, and pydocstyle. These changes enhance the organization of the file and clarify linting and formatting tasks.

Changes

File Change Summary
pyproject.toml - Added [tool.ruff] section with include and preview options.
- Introduced [tool.ruff.lint] section with comprehensive linting rules.
- Added [tool.ruff.lint.pylint] section with relaxed limits for max-args and max-branches.
- Created [tool.ruff.lint.isort] section for import sorting configurations.
- Included [tool.ruff.lint.mccabe] section to set max-complexity to 24.
- Configured [tool.ruff.lint.pycodestyle] to ignore overlong task comments.
- Set [tool.ruff.lint.pydocstyle] convention to "google".
- Added [tool.ruff.lint.flake8-annotations] section for type annotation configurations.

Possibly related PRs

Wdyt about considering any additional linting rules or configurations that might further improve the code quality?


Thank you for using CodeRabbit. We offer it for free to the OSS community and would appreciate your support in helping us grow. If you find it useful, would you consider giving us a shout-out on your favorite social media?

❤️ Share
🪧 Tips

Chat

There are 3 ways to chat with CodeRabbit:

‼️ IMPORTANT
Auto-reply has been disabled for this repository in the CodeRabbit settings. The CodeRabbit bot will not respond to your replies unless it is explicitly tagged.

  • Files and specific lines of code (under the "Files changed" tab): Tag @coderabbitai in a new review comment at the desired location with your query. Examples:
    • @coderabbitai generate unit testing code for this file.
    • @coderabbitai modularize this function.
  • PR comments: Tag @coderabbitai in a new PR comment to ask questions about the PR branch. For the best results, please provide a very specific query, as very limited context is provided in this mode. Examples:
    • @coderabbitai gather interesting stats about this repository and render them as a table. Additionally, render a pie chart showing the language distribution in the codebase.
    • @coderabbitai read src/utils.ts and generate unit testing code.
    • @coderabbitai read the files in the src/scheduler package and generate a class diagram using mermaid and a README in the markdown format.
    • @coderabbitai help me debug CodeRabbit configuration file.

Note: Be mindful of the bot's finite context window. It's strongly recommended to break down tasks such as reading entire modules into smaller chunks. For a focused discussion, use review comments to chat about specific files and their changes, instead of using the PR comments.

CodeRabbit Commands (Invoked using PR comments)

  • @coderabbitai pause to pause the reviews on a PR.
  • @coderabbitai resume to resume the paused reviews.
  • @coderabbitai review to trigger an incremental review. This is useful when automatic reviews are disabled for the repository.
  • @coderabbitai full review to do a full review from scratch and review all the files again.
  • @coderabbitai summary to regenerate the summary of the PR.
  • @coderabbitai generate docstrings to generate docstrings for this PR. (Beta)
  • @coderabbitai resolve resolve all the CodeRabbit review comments.
  • @coderabbitai configuration to show the current CodeRabbit configuration for the repository.
  • @coderabbitai help to get help.

Other keywords and placeholders

  • Add @coderabbitai ignore anywhere in the PR description to prevent this PR from being reviewed.
  • Add @coderabbitai summary to generate the high-level summary at a specific location in the PR description.
  • Add @coderabbitai anywhere in the PR title to generate the title automatically.

CodeRabbit Configuration File (.coderabbit.yaml)

  • You can programmatically configure CodeRabbit by adding a .coderabbit.yaml file to the root of your repository.
  • Please see the configuration documentation for more information.
  • If your editor has YAML language server enabled, you can add the path at the top of this file to enable auto-completion and validation: # yaml-language-server: $schema=https://coderabbit.ai/integrations/schema.v2.json

Documentation and Community

  • Visit our Documentation for detailed information on how to use CodeRabbit.
  • Join our Discord Community to get help, request features, and share feedback.
  • Follow us on X/Twitter for updates and announcements.

@aaronsteers aaronsteers changed the title Chore (linting): Add ruff lint rules Chore(linting): Add ruff lint rules Nov 11, 2024
Copy link
Contributor

@coderabbitai coderabbitai bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Actionable comments posted: 0

🧹 Outside diff range and nitpick comments (3)
pyproject.toml (3)

166-215: Consider enabling some ignored rules in the future

While the ignore list is well-documented, consider enabling these rules in the future to improve code quality:

  • D102, D103: Enforcing docstrings would improve code documentation
  • E501: Since line length is set to 100, this rule could help maintain consistent line lengths

248-250: Consider reducing complexity thresholds

The current McCabe complexity threshold of 24 is quite high and might allow overly complex functions. Consider:

  1. Reducing it to a more standard value (10-15)
  2. Gradually decreasing it over time to encourage simpler functions

225-228: Review relaxed Pylint thresholds

The relaxed thresholds for max-args (8) and max-branches (15) might lead to functions that are harder to maintain. Consider:

  1. Keeping max-args but adding a comment explaining why 8 is acceptable
  2. Reducing max-branches to align with cognitive complexity best practices
📜 Review details

Configuration used: CodeRabbit UI
Review profile: CHILL

📥 Commits

Reviewing files that changed from the base of the PR and between 734d778 and 2a2f543.

📒 Files selected for processing (2)
  • pyproject.toml (1 hunks)
  • pyproject.toml-bak (0 hunks)
💤 Files with no reviewable changes (1)
  • pyproject.toml-bak
🔇 Additional comments (3)
pyproject.toml (3)

110-110: Consider the stability implications of enabling preview mode

Preview mode enables features that are not yet stable and may introduce breaking changes in future Ruff updates. Consider disabling it if stability is a priority.


113-165: Well-structured and comprehensive rule selection

The selected rules provide excellent coverage across various aspects of code quality:

  • Security rules (B, BLE)
  • Performance rules (PERF)
  • Code quality rules (E, F)
  • Style rules (I, Q)

Each rule is well-documented with references to the official documentation.


216-222: Good safety considerations in fix configurations

The unfixable rules are well-chosen to prevent accidental data loss and maintain code safety:

  • Preserving commented code (ERA001)
  • Preserving print statements (T201)
  • Preventing unsafe type checking modifications (TCH001, TCH002)

@github-actions github-actions bot added the chore label Nov 12, 2024
Copy link
Contributor

@coderabbitai coderabbitai bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Actionable comments posted: 1

🧹 Outside diff range and nitpick comments (3)
pyproject.toml (3)

118-171: Consider potential rule conflicts

You've selected a comprehensive set of rules! 🎉 However, I noticed some rules that might conflict with each other or be too strict initially:

  • FA (future-annotations) with ANN (annotations) might have overlapping concerns
  • FBT (boolean-trap) might be too strict for existing codebase

Would it make sense to start with a smaller subset and gradually enable more rules as the codebase adapts? This could help with adoption and reduce initial friction. wdyt?


172-221: Consider documenting rationale for ignored rules

Love how you've organized the ignore list into categories! 🌟 For some rules like PLR6201 and SIM910, you've added clear explanations. Would it be helpful to add similar rationales for other ignored rules? This could help future maintainers understand why certain decisions were made.

For example:

  • Why is D (pydocstyle) ignored when we're using Google convention?
  • Is S (bandit) something we want to enable later for security?

wdyt?


222-228: Review unfixable rules list

Great job protecting against unsafe changes! 🛡️ I noticed TCH001 and TCH002 are marked as unfixable due to Pydantic and Serpyco models. Since we're using these frameworks extensively, this makes sense.

However, would it be worth reconsidering ERA001 (commented-out code)? While I understand the caution, commented-out code often indicates technical debt. Maybe we could enable fixing it in a separate cleanup PR? wdyt?

📜 Review details

Configuration used: CodeRabbit UI
Review profile: CHILL

📥 Commits

Reviewing files that changed from the base of the PR and between 2a2f543 and 6460550.

📒 Files selected for processing (1)
  • pyproject.toml (1 hunks)
🔇 Additional comments (1)
pyproject.toml (1)

231-273: Document rationale for relaxed limits

I see you've relaxed some limits:

  • max-args = 8 (up from 5)
  • max-branches = 15 (up from 12)
  • max-complexity = 24

These values are quite permissive. Would it be helpful to add comments explaining why these specific values were chosen? This could help prevent them from being accidentally tightened in the future. Also, should we consider gradually reducing these limits over time? wdyt?

Comment on lines +112 to +117
include = ["airbyte_cdk/**/*.py"] # Ignore test and script folders

target-version = "py310"
line-length = 100
preview = true

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

💡 Codebase verification

Preview mode is already disabled in the codebase

Hey! I noticed that while this PR is adding preview = true, there's actually a conflicting setting elsewhere in the codebase that sets preview = false. Since preview features can indeed introduce breaking changes, would it be better to keep it disabled for now? This would help maintain stability across ruff updates. wdyt?

🔗 Analysis chain

Consider the implications of enabling preview mode

Hey! I noticed you've enabled preview mode for ruff. While this gives us access to the latest features, it might introduce breaking changes in future updates. What do you think about adding a comment explaining why we need preview mode? This would help future maintainers understand the decision. wdyt?

🏁 Scripts executed

The following scripts were executed for the analysis:

Script:

#!/bin/bash
# Check if any preview features are actually being used
rg -l "preview.*=.*true" . | xargs ruff --show-settings 2>&1 | grep -i "preview"

Length of output: 75


Script:

#!/bin/bash
# Let's check what ruff rules are currently enabled and if any preview features are mentioned
rg -A 5 -B 5 "\[tool\.ruff\]" pyproject.toml

# Also check if there are any comments mentioning preview features
rg -i "preview" pyproject.toml

Length of output: 586

Copy link
Contributor

@coderabbitai coderabbitai bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Actionable comments posted: 0

🧹 Outside diff range and nitpick comments (4)
pyproject.toml (4)

122-122: Consider using a more inclusive pattern for Python files

The current pattern "airbyte_cdk/**/*.py" looks good but might miss Python files in other directories. Would you consider using ["**/*.py", "**/*.pyi"] to catch all Python files and type stubs, and then use exclude for specific directories you want to ignore? wdyt?


129-181: Great selection of rules! Consider phasing them in gradually

The rule selection is comprehensive and well-documented! However, enabling all these rules at once might be overwhelming. Would you consider:

  1. Adding comments for each rule category explaining why it was chosen?
  2. Implementing rules in phases, starting with the most critical ones?

182-231: Excellent documentation of ignored rules!

Love how you've documented the rationale for ignored rules and separated them into "fix later" vs "won't fix"! This will be super helpful for maintenance. One suggestion: would you consider adding GitHub issues for the rules marked with "Consider re-enabling these when we have time"? This would help track technical debt. wdyt?


241-244: Consider documenting the rationale for relaxed limits

The relaxed limits for max-args and max-branches make sense, but would you consider adding comments explaining why these specific values were chosen? This would help future maintainers understand if these limits need adjustment. wdyt?

📜 Review details

Configuration used: CodeRabbit UI
Review profile: CHILL

📥 Commits

Reviewing files that changed from the base of the PR and between 6460550 and 5d3dc2e.

📒 Files selected for processing (1)
  • pyproject.toml (1 hunks)
🔇 Additional comments (3)
pyproject.toml (3)

126-126: Consider the implications of enabling preview mode

Hey! I noticed you've enabled preview mode for ruff. While this gives us access to the latest features, it might introduce breaking changes in future updates. What do you think about adding a comment explaining why we need preview mode? This would help future maintainers understand the decision. wdyt?

#!/bin/bash
# Check if any preview features are actually being used
rg -l "preview.*=.*true" . | xargs ruff --show-settings 2>&1 | grep -i "preview"

232-238: LGTM! Safe configuration for auto-fixes

Great choice on marking code comments and print statements as unfixable to prevent accidental loss of important information!


264-265: Verify if the complexity threshold is appropriate

The McCabe complexity threshold of 24 is quite high compared to the typical recommendation of 10-15. Would you consider:

  1. Adding a comment explaining why this threshold was chosen?
  2. Running some analysis to see what the current complexity distribution looks like in the codebase?
✅ Verification successful

Based on the analysis of the codebase, I can provide a final response:

The McCabe complexity threshold of 24 seems reasonable for this codebase

Looking at the test files and implementation code, there are legitimate cases of higher complexity, particularly in:

  • Test scenarios with multiple nested conditions (e.g., test_client_side_record_filter_decorator_with_cursor_types)
  • Stream handling with various state management scenarios
  • Error handling and retry logic

The threshold of 24 appears to be a pragmatic choice that balances code maintainability with the inherent complexity of implementing streaming data connectors. Would you consider adding a comment like this to explain the rationale? wdyt?

[tool.ruff.lint.mccabe]
# Higher threshold to accommodate complex stream handling, state management,
# and comprehensive test scenarios typical in data connector implementations
max-complexity = 24
🏁 Scripts executed

The following scripts were executed for the analysis:

Script:

#!/bin/bash
# Check current complexity distribution
rg -l "def " | xargs ruff check --select C90 --statistics 2>&1 | sort -n

Length of output: 108


Script:

#!/bin/bash
# Find Python files with high cyclomatic complexity using ast-grep
# Look for functions with many conditional branches, loops, and exception handlers
ast-grep --pattern 'def $_ ($$$):
  $$$
  if $_:
    $$$
  if $_:
    $$$
  if $_:
    $$$
  $$$'

# Also search for functions with many elif branches
ast-grep --pattern 'def $_ ($$$):
  $$$
  if $_:
    $$$
  elif $_:
    $$$
  elif $_:
    $$$
  $$$'

# Search for functions with nested loops
ast-grep --pattern 'def $_ ($$$):
  $$$
  for $_ in $_:
    $$$
    for $_ in $_:
      $$$'

# Get a list of all Python files for manual inspection if needed
fd -e py

Length of output: 60867

Copy link
Contributor

@coderabbitai coderabbitai bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Actionable comments posted: 0

🧹 Nitpick comments (5)
pyproject.toml (5)

120-121: Consider using a more specific include pattern

The current pattern includes all Python files in airbyte_cdk. Would it be more precise to use ["airbyte_cdk/**/*.py", "airbyte_cdk/*.py"] to explicitly catch both root and nested files? wdyt?


127-179: Consider a phased rollout of lint rules

You've selected a comprehensive set of rules, which is great! However, would it make sense to introduce these in phases? We could start with essential rules (E, F, W) and gradually enable others as the codebase adapts. This could help with adoption and reduce initial friction. wdyt?


180-230: Consider enabling security-related rules

I notice that security-related rules (S) are ignored. While they can be noisy, would it be worth enabling specific ones like S101 (assert statements) or S106 (hardcoded passwords)? This could help catch potential security issues early. wdyt?


239-242: Consider documenting the rationale for relaxed limits

The increased limits for max-args (8) and max-branches (15) seem reasonable, but would it be helpful to add a comment explaining why these specific values were chosen? This could help future maintainers understand the decision. wdyt?


275-281: Inconsistent preview settings between format and lint

I noticed that format.preview is set to false while lint.preview is true. Should we align these settings to maintain consistency? wdyt?

📜 Review details

Configuration used: CodeRabbit UI
Review profile: CHILL
Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between 5d3dc2e and 90d1d8b.

📒 Files selected for processing (1)
  • pyproject.toml (1 hunks)
🔇 Additional comments (2)
pyproject.toml (2)

124-125: Preview mode is already disabled in the codebase

Hey! I noticed that while this PR is adding preview = true, there's actually a conflicting setting elsewhere in the codebase that sets preview = false. Since preview features can indeed introduce breaking changes, would it be better to keep it disabled for now? This would help maintain stability across ruff updates. wdyt?


230-238: LGTM! Good safeguards against unsafe fixes

The fixable/unfixable configuration looks well-balanced, especially protecting against silent code loss and unsafe type checking moves.

@natikgadzhi
Copy link
Contributor

@aaronsteers what's blocking you from merging it?

@aaronsteers
Copy link
Contributor Author

@natikgadzhi - This one is stale and didn't fix the issues. I've opened a new pair which can replace this:

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants