New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Spare optimizations for GaBW sampling #331

Open

lucaperju wants to merge 3 commits into GeomScale:develop from lucaperju:gabw_optimizations

Contributor

lucaperju commented Sep 6, 2024

Samples faster when the A matrix of the polytope is sparse.
Very fast when both the A matrix of the polytope and the covariance matrix are sparse.

lucaperju added 2 commits

September 3, 2024 18:11


          simplify gabw sampling

ce1739a


          optimize gabw for sparse polytopes

19565dc

vfisikop reviewed

View reviewed changes

Contributor

vfisikop left a comment

Great work thanks!

include/convex_bodies/hpolytope.h

                   template <typename update_parameters>
                   auto compute_reflection(Point &v, Point &p, update_parameters const& params) const
-                       -> std::enable_if_t<std::is_same_v<MT, Eigen::SparseMatrix<NT, Eigen::RowMajor>> && !std::is_same_v<update_parameters, int>, void> { // MT must be in RowMajor format
+                       -> std::enable_if_t<std::is_same_v<MT, Eigen::SparseMatrix<NT, Eigen::RowMajor>> && !std::is_same_v<update_parameters, int>, void> {

Contributor

vfisikop Sep 13, 2024

What is update_parameters here? I do not understand !std::is_same_v<update_parameters could you please explain?

Contributor Author

lucaperju Sep 13, 2024

basically there's another compute_reflection function above which takes an integer (just the facet) as the 3rd argument, and for some reason, if I don't have that condition the compiler decides to call this function assuming that the typename of update_parameters is int. There might maybe be better ways of dealing with these issues, but I remember I tried to solve them for some time and this is the best I could do, I couldn't at all understand how the compiler chooses which function to use when there's multiple ones that match

include/random_walks/gaussian_accelerated_billiard_walk.hpp Outdated Show resolved Hide resolved

include/convex_bodies/hpolytope.h Outdated Show resolved Hide resolved

include/convex_bodies/hpolytope.h Outdated Show resolved Hide resolved

include/random_walks/gaussian_accelerated_billiard_walk.hpp Outdated

                       typedef typename Polytope::VT VT;
                       typedef typename Point::FT NT;
+                      using AA_type = std::conditional_t< std::is_same_v<MT, typename Eigen::SparseMatrix<NT, Eigen::RowMajor>>, typename Eigen::SparseMatrix<NT>, DenseMT >;

Contributor

vfisikop Sep 13, 2024

similar comment to AE_type, is there a better naming?

Contributor Author

lucaperju Sep 13, 2024

hmm, I'll think of one, I'm not really sure what name I could give it but I'll see if I can come up with a better name.

include/random_walks/gaussian_accelerated_billiard_walk.hpp Show resolved Hide resolved

include/random_walks/gaussian_accelerated_billiard_walk.hpp

+                          if constexpr (std::is_same<AA_type, Eigen::SparseMatrix<NT>>::value) {
+                              _AA = (P.get_mat() * P.get_mat().transpose());
+                          } else {
+                              _AA.noalias() = (DenseMT)(P.get_mat() * P.get_mat().transpose());

Contributor

vfisikop Sep 13, 2024

should we explicitly cast it to DenseMT?

Contributor Author

lucaperju Sep 13, 2024

I'm not sure what you mean, for the optimizations I need it to be in colmajor SparseMatrix format

include/random_walks/gaussian_accelerated_billiard_walk.hpp Outdated Show resolved Hide resolved

include/random_walks/gaussian_accelerated_billiard_walk.hpp

                       E_type _E;
                       VT _AEA;
                       unsigned int _rho;
                       update_parameters _update_parameters;
                       typename Point::Coeff _lambdas;
                       typename Point::Coeff _Av;
-                      bool was_reset;
+                      BoundaryOracleHeap<NT> _distances_set;

Contributor

vfisikop Sep 13, 2024

This is defined in uniform ABW, it should be better if it is defined in a separate file and both walks include it.

Contributor Author

lucaperju Sep 13, 2024

yeah, I was thinking about that too, any suggestions for that file name?

include/random_walks/gaussian_accelerated_billiard_walk.hpp

+                                      _distances_set.vec[i].first = ( *(b_data + i) - (*(Ar_data + i)) ) / (*(Av_data + i));
+                                  }
+                                  // rebuild the heap with the new values of (b - Ar) / Av
+                                  _distances_set.rebuild(_update_parameters.moved_dist);

Contributor

vfisikop Sep 13, 2024

Why not inserting the new values in the heap (in O(logn)) instead of rebuilding (in O(n))?

Contributor Author

lucaperju Sep 13, 2024 •

edited

Loading

here, this happens after we set a new direction, in which case now all the values have changed, so it's quicker to rebuild (O(n)) rather than insert each one (O(nlogn)). I'm not entirely sure if it makes a difference, but I think it does, since afterwards I never do O(nlogn) things, just O(non_zeroes * logn), so this O(nlogn) could be a bottleneck


          minor changes

4fc2b3f

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet