Caching translations implementation #202

jerinphilip · 2021-06-30T12:25:59Z

Fixes #201.

…rds)

src/translator/request.h

kpu · 2021-07-04T17:10:16Z

If we don't care too much about redoing some work, why not a direct mapped cache with atomic pointers?

… now

XapaJIaMnu · 2021-09-30T13:47:43Z

src/translator/cache.h

+  // Limit of size (in bytes) of storage_
+  size_t storageSizeLimit_;
+
+  HashCacheKey hashFn_;


Again this is strange, as it holds no state.

~~This is now a static member function at the interface, with CacheKey and hash(CacheKey) being protected members. Anyone outside is thus forbidden from using these,~~ (Edit: Removed this to avoid confusion).

and the static member function should eliminate any problems with "state".

XapaJIaMnu

Might have missed something as this is 1.5k lines of code. Looks better, take a look at the comments.

…o struct

src/translator/response_builder.cpp

… --version

…rough a WASM path and native path

src/translator/cache.cpp

src/translator/processed_request_sentence.h

kpu · 2021-10-04T13:54:13Z

src/translator/cache.cpp

+      cacheConfig_(config.sizeInMB * 1024 * 1024, std::chrono::seconds(config.timeToLiveInMilliseconds),
+                   config.removeExpired),
+      service_(epochManagerConfig_),
+      context_(service_.GetContext()),


I don't understand the lifecycle management of the L4 values you're reading enough. Currently you have one read context_ that is used for all read operations across all threads.

Supposedly context_ keeps things alive:

https://github.com/Microsoft/L4/wiki/Epoch-Queue

Does that mean reads are just accumulating? What is decrementing the reference count?

If they're not just accumulating, then how you can be sure that the value is still there once other threads have done reads against the same context?

Does that mean reads are just accumulating? What is decrementing the reference count?

This is correct. They are accumulating. Fixes have been pushed, however, L4 is not lock-free on read-anymore in this case.

@XapaJIaMnu

Buckets matter now. test_cache_hparam.sh http://ix.io/3AUP

The multi/single (40/1) cache (on/off) 1M/100K experiment test_cache_overhead.sh http://ix.io/3AUQ

I'm confused, were the previous experiments 1thread? This is more so the result I would expect in a multithreaded solution so, a default buckets = num_threads or num_threads/2

were the previous experiments 1thread?

Writes weren't flushed until destruction of Service -> Cache -> L4:, so not a proper contention setting. L4 doesn't allow lock-free reads anymore, limitation of API @ context_.

L4 allows a second reader to accept context and read "multiple" times without locks as demonstrated with multiple key-value pairs in https://github.com/browsermt/L4/blob/master/Examples/main.cpp, while something else writes I suppose. For our use-case, our code is no-longer "lock-free" (be it read or write).

I have missed this conversation. Why is the code no-longer lock-free on read? Was what was done before wrong?

browsermt/L4#2 documents the life-cycle, we had leaks due to read-context being kept alive for lock-free"ness". This would however clear up at the end with accounting not reflecting the same.

I manually added code in L4 to check if the deallocate deferred actions are executed. They are not executed if context_ is held for lock-free reads.

74e890c (#202) is the fix, which should indicate what was done wrong.

kpu · 2021-10-07T09:18:28Z

Here's some code for a cache.

It's actually lock free
Manages a shared_ptr without any of the serialization stuff so you can keep the types as is
Downgrades gracefully to bad platforms like webassembly
Doesn't have overengineered options configuration
No new submodule

It probably has a worse eviction policy.

#include <memory>
#include <vector>

template <class Entry, class Hash = std::hash<Entry>, class Equals = std::equal_to<Entry> > class SimpleCache {
  public:
    explicit SimpleCache(std::size_t size) : entries_(size) {}

    template <class Key> std::shared_ptr<Entry> Find(const Key &key) const {
      const std::shared_ptr<Entry> &bucket = entries_[hash_(key) % entries_.size()];
      std::shared_ptr<Entry> ret =
#ifdef WASM
        bucket
#else
        std::atomic_load(&bucket);
#endif
      if (equals_(key, *ret)) {
        return ret;
      } else {
        return std::shared_ptr<Entry>();
      }
    }

    void Store(std::shared_ptr<Entry> entry) {
      std::shared_ptr<Entry> &bucket = entries_[hash_(*entry) % entries_.size()];
#ifdef WASM
      bucket = entry;
#else
      atomic_store(&bucket, entry);
#endif
    }

  private:
    std::vector<std::shared_ptr<Entry> > entries_;

    Hash hash_;
    Equals equals_;
};

kpu · 2021-10-07T09:40:41Z

Contiguous buckets will suffer some false sharing. I guess they could be spaced out to cache line size.

jerinphilip · 2021-10-25T14:37:08Z

Closing this in favour of #227.

Jerin Philip added 16 commits June 29, 2021 15:08

Adding a List + Hashmap LRU Cache with thread-safety

bfdc6d2

simplifying some mess

b4bef8a

Temporary placement of cache for integration, pending hash(marian::Wo…

1e78cb7

…rds)

Missing comma

c1d8338

Getting compilation back to work

bed7f11

equal_to is defined for containers provided elements have it defined

869a315

Cache population code added

d2cd678

All filled from cache corner case

363fc65

Adding a few cache-stats to enable checks

d5df177

brt: + tests for cache

59a2139

cache stats and test app

ebe2bfa

brt: fix error ambiguous redirect

c3a9cc6

Initializing cache stats properly

d1c7eb2

Naming to Key, Value; Stats are now within cache

3628b6d

Bulk insert for cache; Flushing cache-refresh at request completion

441704a

move that guard back up

db78fe0

jerinphilip commented Jun 30, 2021

View reviewed changes

src/translator/request.h Outdated Show resolved Hide resolved

Jerin Philip added 12 commits July 8, 2021 09:32

Adding L4

98e3a29

Submodule L4: Make boost optional

f0aca14

Adding L4 onto bergamot-build

de5115f

sync with callback instead of future: service, brt

bc79cfb

Empty commit; brt push to github to resolve commit

a83a254

Fixing some merge artifacts

421efbc

L4: exp on windows on CI; Removing guard for MSVC sln only

e1fe73d

submodule l4: cmake cmp0077

79186db

L4: More boost strip

fed21de

L4: <functional> for reference_wrapper

88aaaab

Intermediate; Make History eager instead of marian's lazy

dedcc24

L4 integration complete; Cache is not working - diagnosing at runtime…

da19790

… now

XapaJIaMnu reviewed Sep 30, 2021

View reviewed changes

XapaJIaMnu requested changes Sep 30, 2021

View reviewed changes

Jerin Philip added 5 commits October 1, 2021 12:08

Make some editable <>Service::Config const

b8efe0e

Convert HashCacheKey struct to function

12eb0e1

Reorganize config/common structs and functions into interface

9326868

Massive config rework, things are hierarchical, cmd parsing is next t…

3ddffd7

…o struct

Make decoder benchmark consistent with rest of the test/benchmark suite

f3487c5

jerinphilip mentioned this pull request Oct 1, 2021

Alignment information XapaJIaMnu/translateLocally#52

Merged

jerinphilip commented Oct 1, 2021

View reviewed changes

src/translator/response_builder.cpp Show resolved Hide resolved

Jerin Philip added 9 commits October 1, 2021 15:29

Empty commit to trigger workflows again

688e068

Syntax fixes

e8d1057

Make configs + addOptions consistent

fd46682

Moving doc comment in ThreadsafeL4Cache

12d731e

Remove cli.h, make bergamot app native only

13666ae

--model-config-path being required causes trouble for --build-info or…

bfb203d

… --version

BRT: app/bergamot --mode decoder -> bergamot-test-native --mode decoder

80ffa15

Remove confusing protected access, add some more documentation

47be5ea

Removing duplication; If it's part of testSuite it can run at both th…

b7de985

…rough a WASM path and native path

jerinphilip commented Oct 4, 2021

View reviewed changes

src/translator/cache.cpp Outdated Show resolved Hide resolved

hashTableIndex_ is const, still grabbed from L4

cd87254

kpu reviewed Oct 4, 2021

View reviewed changes

src/translator/processed_request_sentence.h Outdated Show resolved Hide resolved

kpu reviewed Oct 4, 2021

View reviewed changes

Jerin Philip added 2 commits October 4, 2021 16:53

context is RAII. Read, Write both are locking

74e890c

StorageIO constructor explicit

90173c1

jerinphilip mentioned this pull request Oct 9, 2021

Cache for translations #227

Merged

jerinphilip closed this Oct 25, 2021

jerinphilip mentioned this pull request Oct 25, 2021

Python bindings and a module #234

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Caching translations implementation #202

Caching translations implementation #202

jerinphilip commented Jun 30, 2021 •

edited

Loading

kpu commented Jul 4, 2021

XapaJIaMnu Sep 30, 2021

jerinphilip Oct 3, 2021 •

edited

Loading

XapaJIaMnu left a comment

kpu Oct 4, 2021 •

edited

Loading

jerinphilip Oct 4, 2021

jerinphilip Oct 5, 2021

XapaJIaMnu Oct 5, 2021 •

edited

Loading

jerinphilip Oct 5, 2021 •

edited

Loading

XapaJIaMnu Oct 5, 2021

jerinphilip Oct 5, 2021

kpu commented Oct 7, 2021 •

edited

Loading

kpu commented Oct 7, 2021

jerinphilip commented Oct 25, 2021

Caching translations implementation #202

Caching translations implementation #202

Conversation

jerinphilip commented Jun 30, 2021 • edited Loading

kpu commented Jul 4, 2021

XapaJIaMnu Sep 30, 2021

Choose a reason for hiding this comment

jerinphilip Oct 3, 2021 • edited Loading

Choose a reason for hiding this comment

XapaJIaMnu left a comment

Choose a reason for hiding this comment

kpu Oct 4, 2021 • edited Loading

Choose a reason for hiding this comment

jerinphilip Oct 4, 2021

Choose a reason for hiding this comment

jerinphilip Oct 5, 2021

Choose a reason for hiding this comment

XapaJIaMnu Oct 5, 2021 • edited Loading

Choose a reason for hiding this comment

jerinphilip Oct 5, 2021 • edited Loading

Choose a reason for hiding this comment

XapaJIaMnu Oct 5, 2021

Choose a reason for hiding this comment

jerinphilip Oct 5, 2021

Choose a reason for hiding this comment

kpu commented Oct 7, 2021 • edited Loading

kpu commented Oct 7, 2021

jerinphilip commented Oct 25, 2021

jerinphilip commented Jun 30, 2021 •

edited

Loading

jerinphilip Oct 3, 2021 •

edited

Loading

kpu Oct 4, 2021 •

edited

Loading

XapaJIaMnu Oct 5, 2021 •

edited

Loading

jerinphilip Oct 5, 2021 •

edited

Loading

kpu commented Oct 7, 2021 •

edited

Loading