Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

index: data: support imports #9181

Merged
merged 1 commit into from
Mar 15, 2023
Merged

Conversation

efiop
Copy link
Contributor

@efiop efiop commented Mar 15, 2023

Stepping stone to simplifying dvc fetch/pull by using index.

Fetch handles regular imports through index already, but not repo imports because their processing is much more involved (e.g. chained imports) in the current arch.

With FileStorage support introduced into DataIndex and datafs supporting imports overall, dvcfs can now handle repo imports (even chained ones). This will soon allow us to handle repo imports the same way we handle regular ones, improve performance and get rid of a lot of messy code (e.g. DependencyRepo).

Related iterative/scmrepo#207
Related iterative/dvc-data#315
Related https://github.com/iterative/studio/issues/5261

@efiop efiop marked this pull request as draft March 15, 2023 03:06
@efiop efiop changed the title index: data: support imports [WIP] index: data: support imports Mar 15, 2023
@codecov
Copy link

codecov bot commented Mar 15, 2023

Codecov Report

Patch coverage: 100.00% and project coverage change: -0.25 ⚠️

Comparison is base (e6419b9) 92.91% compared to head (6927b3d) 92.66%.

Additional details and impacted files
@@            Coverage Diff             @@
##             main    #9181      +/-   ##
==========================================
- Coverage   92.91%   92.66%   -0.25%     
==========================================
  Files         456      456              
  Lines       36884    36886       +2     
  Branches     5324     5324              
==========================================
- Hits        34269    34181      -88     
- Misses       2091     2162      +71     
- Partials      524      543      +19     
Impacted Files Coverage Δ
dvc/dependency/repo.py 95.58% <100.00%> (+0.06%) ⬆️
dvc/repo/index.py 90.90% <100.00%> (ø)

... and 22 files with indirect coverage changes

Help us with your feedback. Take ten seconds to tell us how you rate us. Have a feature suggestion? Share it here.

☔ View full report in Codecov by Sentry.
📢 Do you have feedback about the report comment? Let us know in this issue.

Stepping stone to simplifying `dvc fetch/pull` by using index.

Fetch handles regular imports through index already, but not repo
imports because their processing is much more involved (e.g. chained
imports) in the current arch.

With `FileStorage` support introduced into `DataIndex` and `datafs`
supporting imports overall, `dvcfs` can now handle repo imports (even
chained ones). This will soon allow us to handle repo imports the same
way we handle regular ones, improve performance and get rid of a lot of
messy code (e.g. DependencyRepo).

Related iterative/scmrepo#207
Related iterative/dvc-data#315
Related https://github.com/iterative/studio/issues/5261
@efiop efiop force-pushed the fetch-import-index branch from eff2999 to 6927b3d Compare March 15, 2023 17:33
@efiop efiop changed the title [WIP] index: data: support imports index: data: support imports Mar 15, 2023
@efiop efiop marked this pull request as ready for review March 15, 2023 17:58
@efiop efiop merged commit 53f9953 into iterative:main Mar 15, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant