performance of per-worker lru cache vs shared dict

jonathan · 2017-03-04T01:13:04+00:00

According the answer from agentzh, the only way is to change nginx code. See this URL: https://github.com/openresty/headers-more-nginx-module/issues/22#issue...

performance of per-worker lru cache vs shared dict

jonathan

a while back i saw someone do a benchmark of the two ways to cache data in openresty. i haven't been able to find it though, and spent some time looking.

does anyone have a reference, or have you done something similar?

rpaprocki

Hey,

On Fri, Mar 3, 2017 at 9:13 AM, <jon...@findmeon.com> wrote:

a while back i saw someone do a benchmark of the two ways to cache data in openresty. i haven't been able to find it though, and spent some time looking.

I've written a handful of posts about perfs of various data stores:

https://www.cryptobells.com/properly-scoping-lua-nginx-modules-ngx-ctx/

https://www.cryptobells.com/openresty-performance-ngx-ctx-vs-ngx-shared-dict/

As for lua-resty-lrucache vs shared dictionary storage, i would imagine you would have to generate a very significant workload to see a difference between the two. the dictionary mutex could be a problem in very high concurrency environments, and assuming an lrucache workload highly biased toward reads, you might see slightly better performance (the downside obviously being use cases become more complex as the data is not shared among workers)

jonathan

Thanks for this!

I adapted your performance script to do a quick dirty check -- swapping the ctx for lru, and running 2 tests -- looping(write+read) and write+loop(read)

On both operations, the overhead is small... but the per-worker LRU cache is faster than the shared dict by factor of around 10x.

This sounds in-line with what I remember reading earlier. I needed to pull up some actual stats though, and this dirty test-suite does that.

We have some code in production that stores ssl certificates as cdata in the lru cache and fails over pem data stored in the shared dict (and then fails to upstream data). A friend is dealing with some performance caching issues on high-traffic periods and is using the shared-dict. I thought enabling the LRU cache might help a bit.

rpaprocki

Sounds about right. Getting the mutex operations out of the hot path will make a noticable difference.

Having a read-write lock implementation for shared dicts would really be useful for situations like this. Not sure if that's on the roadmap though.

On Fri, Mar 3, 2017 at 11:56 AM, <jon...@findmeon.com> wrote:

Thanks for this!

I adapted your performance script to do a quick dirty check -- swapping the ctx for lru, and running 2 tests -- looping(write+read) and write+loop(read)

On both operations, the overhead is small... but the per-worker LRU cache is faster than the shared dict by factor of around 10x.

This sounds in-line with what I remember reading earlier. I needed to pull up some actual stats though, and this dirty test-suite does that.

We have some code in production that stores ssl certificates as cdata in the lru cache and fails over pem data stored in the shared dict (and then fails to upstream data). A friend is dealing with some performance caching issues on high-traffic periods and is using the shared-dict. I thought enabling the LRU cache might help a bit.

.