Network solvers redesign #242

emstoudenmire · 2025-07-02T16:10:42Z

This PR is a rewrite of codes for "sweep solver" algorithms such as DMRG, TDVP, etc.
It introduces a simplified designs for the whole system, especially regarding the creation of "region plans", how the double loop comprising what is now called sweep_solve is coded, and the handling of keyword arguments.

The PR also adds a subspace expansion system with one currently implementation (based on a "projected density matrix perturbation" idea, basically McCulloch+White hybrid method). Having a subspace expansion is crucial to converge DMRG on certain tree lattices, not to mention other cases like 2D DMRG with QN conservation.

Internally, the core design revolves around two iterators: a sweep iterator and a region iterator. Each of these is in principle wrappable by "iteration adapters" (see for example the tuple adapter in adapters.jl). At each iteration of the RegionIterator type, it calls a region_iterator_action function which can be overloaded, but by default calls three "subactions": extracter, updater, and inserter. Currently extracter also calls down into a subspace_expand function which can be customized through multiple backends. These "action" functions all dispatch on a "problem" type which can hold arbitrary data, making these codes more flexible and future-proof toward cases like optimizing two tensor network states at once, working with sets of operators, etc.

Other improvements that may fit better into future PRs:

install the "fitting" implementations of truncate and apply already in the NetworkSolvers repo (there were some issues bringing them over)
replace the use of ProjTTN with a BP cache
more friendly interfaces to methods like dmrg which automatically propagate truncation arguments into more "expert" keyword argument packs
helper functions for making keyword parameter packs and propagating defaults
discuss what is the best strategy to optionally truncate the bonds in 1-site TDVP, perhaps during the orthogonalize/gauge_walk part of the extracter step
improve the subspace expansion code to expand all bonds around the current region and simplify the current code (the first change may already help to simplify the code somewhat)

emstoudenmire · 2025-07-02T16:11:39Z

Fyi, At least one test was failing on my machine, but it didn't seem to be related at all to the code in this PR.

codecov · 2025-07-02T16:14:24Z

Codecov Report

Attention: Patch coverage is 0% with 478 lines in your changes missing coverage. Please review.

Project coverage is 0.00%. Comparing base (749fb78) to head (a5247f4).

Files with missing lines	Patch %	Lines
src/solvers/applyexp.jl	0.00%	49 Missing ⚠️
src/solvers/fitting.jl	0.00%	48 Missing ⚠️
src/solvers/subspace/densitymatrix.jl	0.00%	44 Missing ⚠️
src/solvers/iterators.jl	0.00%	42 Missing ⚠️
src/solvers/region_plans/euler_tour.jl	0.00%	37 Missing ⚠️
src/solvers/subspace/ortho_subspace.jl	0.00%	37 Missing ⚠️
src/solvers/eigsolve.jl	0.00%	29 Missing ⚠️
src/solvers/region_plans/tdvp_region_plans.jl	0.00%	29 Missing ⚠️
src/solvers/subspace.jl	0.00%	28 Missing ⚠️
src/solvers/inserter.jl	0.00%	18 Missing ⚠️
... and 12 more

Additional details and impacted files

@@          Coverage Diff           @@
##            main    #242    +/-   ##
======================================
  Coverage   0.00%   0.00%            
======================================
  Files         75      86    +11     
  Lines       3784    4042   +258     
======================================
- Misses      3784    4042   +258

Flag	Coverage Δ
docs	`0.00% <0.00%> (ø)`

Flags with carried forward coverage won't be shown. Click here to find out more.

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

emstoudenmire · 2025-07-02T16:16:35Z

Is the version check meaning I should just bump the version number? Which version number should I bump to?

mtfishman · 2025-07-02T16:21:53Z

Is the version check meaning I should just bump the version number? Which version number should I bump to?

The version check is checking whether or not you've bumped the package version here:

ITensorNetworks.jl/Project.toml

Line 4 in 2d7db49

version = "0.13.12"

. The reason for the check is that in general we are trying to enforce that we always bump the package version in every PR and register a new version after the PR is merged, so that versions don't drag out and accumulate too many changes, turnover time to release bug fixes and features for users is faster, etc. I'm sure this PR is breaking in some way so you should bump the version to v0.14.

src/solvers/applyexp.jl

src/solvers/inserter.jl

src/solvers/permute_indices.jl

src/solvers/eigsolve.jl

mtfishman · 2025-07-02T22:15:42Z

src/solvers/eigsolve.jl

+end
+
+eigenvalue(E::EigsolveProblem) = E.eigenvalue
+ITensorNetworks.state(E::EigsolveProblem) = E.state


Suggested change

ITensorNetworks.state(E::EigsolveProblem) = E.state

state(E::EigsolveProblem) = E.state

I was getting an error if I didn't prepend ITensorNetworks here, which is why I put it. I'll find out the exact error and maybe we can fix the issue more at the root of what's causing it.

src/solvers/eigsolve.jl

mtfishman · 2025-07-02T22:26:47Z

src/solvers/adapters.jl

+# generates each tuple?
+#
+
+mutable struct TupleRegionIterator{RegionIter}


Maybe we can come up with a name that is more descriptive, like RegionIteratorWithKwargs. Tuple is a bit vague.

mtfishman · 2025-07-02T22:28:41Z

src/solvers/applyexp.jl

+  current_time::Number = 0.0
+end
+
+ITensorNetworks.state(A::ApplyExpProblem) = A.state


Suggested change

ITensorNetworks.state(A::ApplyExpProblem) = A.state

state(A::ApplyExpProblem) = A.state

since we are in the ITensorNetworks module.

I'll look into what error I was getting when I didn't prepend ITensorNetworks. I agree it shouldn't be needed in principle.

mtfishman · 2025-07-02T22:30:55Z

src/solvers/iterators.jl

+
+  if !isnothing(which)
+    S.region_iter = region_iterator(
+      problem(S.region_iter); sweep=S.which_sweep, current_sweep_kws...


Suggested change

problem(S.region_iter); sweep=S.which_sweep, current_sweep_kws...

problem(S); sweep=S.which_sweep, current_sweep_kws...

I think, based on the definition of problem(::SweepIterator) above.

Also, this code pattern confused me a bit, what do you think of writing it like this:

update_region_iterator!(S; current_sweep_kws...)

and hiding the implementation in update_region_iterator!?

src/solvers/iterators.jl

mtfishman · 2025-07-02T22:50:22Z

src/solvers/iterators.jl

+mutable struct SweepIterator
+  sweep_kws
+  region_iter
+  which_sweep::Int
+end


What about parameterizing this struct by the types of the fields sweep_kws and region_iter? Is it meant to be dynamic, i.e. is it expected that those types might change?

It seems best to have a goal of having those be concrete and not changing type, but if they do you can still parameterize the type and then set the type parameter to an abstract type when it is being constructed (as needed), for example you can explicitly construct it as SweepIterator{Any,Any}(sweep_kws, region_iter, which_sweep).

mtfishman · 2025-07-02T22:58:54Z

src/solvers/iterators.jl

+
+problem(R::RegionIterator) = R.problem
+current_region_plan(R::RegionIterator) = R.region_plan[R.which_region]
+current_region(R::RegionIterator) = current_region_plan(R)[1]


What's the data structure for the output of current_region_plan(R)? Something doesn't feel right to me that the region is accessed with indexing by 1, maybe it should be a NamedTuple and the region could be accessed as current_region_plan(R).region or a struct where it is accessed with a function get_region(current_region_plan(R))?

mtfishman · 2025-07-02T23:00:07Z

src/solvers/iterators.jl

+current_region(R::RegionIterator) = current_region_plan(R)[1]
+region_kwargs(R::RegionIterator) = current_region_plan(R)[2]
+function previous_region(R::RegionIterator)
+  R.which_region==1 ? nothing : R.region_plan[R.which_region - 1][1]


It seems like maybe we should be using function accessors rather than field accessors, i.e. which_region(R) instead of R.which_region.

JoeyT1994 · 2025-07-04T17:28:41Z

@emstoudenmire the reason for the failing test is that the apply(t1::ttn, t2::ttn; alg = "fit") is no longer in the code (it was in the old solvers code) but there is a test test/test_ttn_contract.jl that uses it. It is now passing to the apply() function in src/apply which should be made more type restricted, i.e. we should ust change apply(o, tn::AbstractITensorNetwork) to apply(o::Union{NamedEdge, ITensors}, tn::AbstractITensorNetwork) and reenable the test once we are settled on the apply interface with the new solver (which can handle this case).

mtfishman · 2025-07-04T21:20:35Z

src/solvers/iterators.jl

+    )
+  end
+  S.which_sweep += 1
+  return S.region_iter, next


I'm a bit on the fence about this, but I would lean towards a design where iterate(::SweepIterator) actually performs the region iteration as opposed to returning the region iterator, so then for _ in sweep_iterator end actually runs the calculation. Then, we could have an iteration adapter such as region_iterators(::SweepIterator) returns an iterator that returns the region iterator at each iteration.

An interesting question is then, in that alternative design, what iterate(::SweepIterator) should output at each iteration. The DifferentialEquations.jl design is that it outputs the iterator itself, i.e. if you call:

for x in sweep_iterator end

x will be the latest updated sweep_iterator, which I think makes sense since then you can access the information of the sweep_iterator inside the loop.

For reference, in DifferentialEquations.jl, iterate is just defined as:

function Base.iterate(integrator::DEIntegrator, state = 0) done(integrator) && return nothing state += 1 step!(integrator) return integrator, state end

which seems like a reasonable goal to aim for, and then the complexity of the implementation is in done(...) and step!(...).

mtfishman · 2025-07-04T21:51:49Z

src/solvers/iterators.jl

+
+mutable struct SweepIterator
+  sweep_kws
+  region_iter


Maybe this should be called current_region_iter or something like that to indicate it is the latest region iterator.

I have to say this part confused me a bit. I see the region iterator is recreated from scratch from the sweep_kws at each sweep, is there a reason to store it? It seems like it could just be made and used "on the fly" at each sweep anyway. Instead it seems like we could just store the problem in the SweepIterator.

Yes, keeping the problem in the sweep iterator could simplify the mechanics quite a lot. I was thinking originally like "better to have fewer fields in SweepIterator, especially since the region iterator will have a copy of the problem" but I think on balance it's best not to have complicated code handling creating each region iterator.

mtfishman · 2025-07-04T22:21:44Z

src/solvers/iterators.jl

+#
+
+mutable struct SweepIterator
+  sweep_kws


Maybe this could be named each_sweep_kwargs or sweep_kwargs_iterator to indicate that it itself is an iterator that returns the keyword arguments for the sweep at each iteration (rather than just the keyword arguments of the latest sweep).

Better handling of zero and vector creation Co-authored-by: Matt Fishman <[email protected]>

… names.

…etworks.jl into network_solvers

mtfishman · 2025-07-07T21:20:28Z

@emstoudenmire it is a bit annoying, but the way we have the packages set up now (which is kind of a necessary evil but could be simplified in the future), when you make a breaking version change you also have to update the compat entry of the package (in this case ITensorNetworks) in the subdirectory Project.toml files:

ITensorNetworks.jl/test/Project.toml

Line 47 in ba9c1ef

ITensorNetworks = "0.13.0"

,

ITensorNetworks.jl/examples/Project.toml

Line 5 in ba9c1ef

ITensorNetworks = "0.13.2"

,

ITensorNetworks.jl/docs/Project.toml

Line 8 in ba9c1ef

ITensorNetworks = "0.13.0"

.

emstoudenmire · 2025-07-08T17:58:41Z

Thanks, Matt. I'll make those Project.toml changes.

(By the way, I may be pushing some commits to this PR but in a way that doesn't signal that all of the review-related changes have been implemented yet. The reason is that I'm sharing this branch with Jason K. as a backend for some TDVP experiments he's helping me with.)

…e generic and correct exponent terminology.

for more information, see https://pre-commit.ci

mtfishman · 2025-07-15T02:16:43Z

src/solvers/applyexp.jl

+operator(A::ApplyExpProblem) = A.operator
+current_exponent(A::ApplyExpProblem) = A.current_exponent
+function current_time(A::ApplyExpProblem)
+  t = im*A.current_exponent


Suggested change

t = im*A.current_exponent

t = im * current_exponent(A)

Just a style change.

mtfishman · 2025-07-15T02:20:14Z

src/solvers/applyexp.jl

+current_exponent(A::ApplyExpProblem) = A.current_exponent
+function current_time(A::ApplyExpProblem)
+  t = im*A.current_exponent
+  return iszero(imag(t)) ? real(t) : t


I'm not sure how I feel about introducing this type instability, let's discuss this. I would lean towards a design where this code is not so clever and just returns im * current_exponent(A), and we provide a different interface for users to be more explicit about the types.

Different interfaces for each case is a good suggestion. How about current_time which returns just the real part of im*A.current_exponent (so usual "physics time") and also current_complex_time which returns a time (im*exponent) but as a complex value, in the sense of "complex time evolution" methods.

That sounds good. Maybe current_time should check that iszero(imag(t)) and if not error, silently ignoring the imaginary part may be confusing.

mtfishman · 2025-07-15T02:21:27Z

src/solvers/applyexp.jl

+
+function sweep_printer(
+  problem::ApplyExpProblem;
+  exponent_description="exponent",


This input is a bit odd to me, let's discuss this.

mtfishman · 2025-07-15T02:22:23Z

src/solvers/align_indices.jl

@@ -0,0 +1,16 @@
+
+function align_indices(tn)


Suggested change

function align_indices(tn)

function align_inds(tn)

since in general ITensor indices are referred to as inds in function names.

JoeyT1994 · 2025-07-16T13:55:26Z

src/apply.jl

A similar change to the function function ITensors.apply( o, ψ::VidalITensorNetwork; ...) lower down (lime 322) will be necessary to avoid ambiguity and the current failing test.

emstoudenmire added 9 commits July 1, 2025 15:37

Delete and move some files to make space for new code

eef7d0f

Add redesigned solver codes

b15d045

Update solvers codes to latest versions

3edcc3d

Remove previous solver includes

8930d80

Add ConstructionBase dependency

9235349

Adapt new solver codes into ITensorNetworks module

d70bf03

Continue adapting code and improve DMRG test

c081dfb

Rename tdvp to time_evolve. Add tests.

e5047fc

Change default outputlevel in tests

3f7442f