Optimize costs #1149

snarfed · 2024-06-24T20:34:19Z

Don't want to draw attention to this, I've been looking at it mostly behind the scenes, but I'd like to start tracking at least the investigation and work more publicly.

I expect there's plenty of low hanging fruit here. Biggest contributors right now are datastore reads and frontend instances, both of which I should be able to cut down. Biggest blocker right now is that I'm not sure what's driving the datastore read load, esp since I added a memcached instance a while back. Hrm.

snarfed · 2024-06-26T23:39:29Z

related: #1152 (but I expect outbox is a small fraction of overall cost)

snarfed · 2024-06-26T23:39:53Z

Looking at https://cloud.google.com/firestore/docs/audit-logging . I've turned on audit logs for Firestore/Datastore API and Access Approval in https://console.cloud.google.com/iam-admin/audit?project=bridgy-federated .

snarfed · 2024-06-26T23:41:26Z

Other leads:

snarfed · 2024-06-28T18:37:03Z

Got ~20h of datastore logs, digging into them in log analytics now. https://console.cloud.google.com/logs/analytics

snarfed · 2024-06-28T19:15:33Z

...maybe promising? Eg this query breaks down by API method:

SELECT DISTINCT
  proto_payload.audit_log.method_name as method_name,
  count(*) as count FROM `bridgy-federated.global._Default._Default`
where log_name='projects/bridgy-federated/logs/cloudaudit.googleapis.com%2Fdata_access'
group by method_name
order by count desc
LIMIT 1000

and this one samples the actual contents of queries:

SELECT proto_payload
FROM `bridgy-federated.global._Default._Default`
where log_name='projects/bridgy-federated/logs/cloudaudit.googleapis.com%2Fdata_access'
  and proto_payload.audit_log.method_name='google.datastore.v1.Datastore.RunQuery'
limit 200

I can't aggregate (group by) by fields inside proto_payload though, I get Grouping by expressions of type JSON is not allowed. Next step, try BigQuery to see if it can get around that.

snarfed · 2024-06-28T19:18:46Z

It's in BigQuery, https://console.cloud.google.com/bigquery?ws=!1m5!1m4!4m3!1sbridgy-federated!2slogs!3s_Default, let's see how that goes. 37M rows total.

snarfed · 2024-06-28T20:31:52Z

Damn, BigQuery can't do it either. Maybe if I pull the JSON out in a view.

snarfed · 2024-07-25T21:48:15Z

Back to looking at this.

snarfed · 2024-07-30T20:46:00Z

Some useful datastore API usage analytics below, and already finding some useful results. Queries are ~40% of datastore API calls, and the vast majority of thoose are looking up original Objects and users for a given copy.

Query

SELECT
  DISTINCT proto_payload.audit_log.method_name as method_name, count(*) as count
FROM bridgy-federated.logs._AllLogs
where log_name='projects/bridgy-federated/logs/cloudaudit.googleapis.com%2Fdata_access'
  group by method_name
  order by count desc

method_name	count
Lookup	4374961
RunQuery	3596350
Commit	548773
BeginTx	353552
Rollback	23279
...

Query

SELECT
  string(proto_payload.audit_log.request.query.kind[0].name) as kind,
  ARRAY_LENGTH(JSON_QUERY_ARRAY(proto_payload.audit_log.request.query.filter.compositeFilter.filters)) as num_composite_filters,
  count(*)
FROM bridgy-federated.logs._Default
where log_name='projects/bridgy-federated/logs/cloudaudit.googleapis.com%2Fdata_access'
  and proto_payload.audit_log.method_name='google.datastore.v1.Datastore.RunQuery'
group by kind, num_composite_filters
order by count(*) desc

kind	num_composites	count
Object		767016
ATProto		764103
MagicKey		755585
ActivityPub		754627
UIProtocol		468614
Follower	2	49023
AtpBlock		17479
AtpRemoteBlob		6890
AtpRepo		3566
Follower		3147
AtpRepo		2
ATProto		2
...

Query

SELECT
  string(proto_payload.audit_log.request.query.kind[0].name) as kind,
  FilterStr(proto_payload.audit_log.request.query.filter) as filter,
  count(*)
FROM bridgy-federated.logs._Default
where log_name='projects/bridgy-federated/logs/cloudaudit.googleapis.com%2Fdata_access'
  and proto_payload.audit_log.method_name='google.datastore.v1.Datastore.RunQuery'
  and proto_payload.audit_log.request.query.filter.propertyFilter is not null
group by kind, filter
order by count(*) desc

kind	filter	count
ActivityPub	copies.uri EQUAL	753344
MagicKey	copies.uri EQUAL	753290
Object	copies.uri EQUAL	753245
ATProto	copies.uri EQUAL	753243
UIProtocol	copies.uri EQUAL	468614
AtpBlock	seq GREATER_THAN_OR_EQUAL	17479
Object	users EQUAL	12535
AtpRemoteBlob	cid EQUAL	6890
ATProto	handle EQUAL	6516
MagicKey	manual_opt_out EQ	2295
Follower	from EQUAL	1575
Follower	to EQUAL	1572
ATProto	enabled_protocols NOT_EQUAL	1232
ActivityPub	enabled_protocols NOT_EQUAL	1231
...

…rator for #1149

snarfed · 2024-07-30T21:54:48Z

^ should hopefully help with queries.

Here's the monitoring I'm watching: https://console.cloud.google.com/apis/api/datastore.googleapis.com/metrics?project=bridgy-federated&pageState=%28%22duration%22%3A%28%22groupValue%22%3A%22PT1H%22%2C%22customValue%22%3Anull%29%29

snarfed · 2024-07-30T22:30:46Z

Now looking at lookups aka gets. Surprising conclusion: the vast majority are for stored DID docs. Who knew.

Query

SELECT
  array_length(JSON_QUERY_ARRAY(proto_payload.audit_log.request.keys)) as num_keys,
  count(*)
FROM bridgy-federated.logs._Default
where log_name='projects/bridgy-federated/logs/cloudaudit.googleapis.com%2Fdata_access'
  and proto_payload.audit_log.method_name='google.datastore.v1.Datastore.Lookup'
group by num_keys
order by count(*) desc

num_keys	count
1	4371422
2	999
3	519
4	365
5	229
6	171
7	101
8	83
9	74
100	68
12	67
...

Query

SELECT
  string(proto_payload.audit_log.request.keys[0].path[0].kind) as kind,
  count(*)
FROM bridgy-federated.logs._Default
where log_name='projects/bridgy-federated/logs/cloudaudit.googleapis.com%2Fdata_access'
  and proto_payload.audit_log.method_name='google.datastore.v1.Datastore.Lookup'
  and array_length(JSON_QUERY_ARRAY(proto_payload.audit_log.request.keys)) = 1
group by kind
order by count(*) desc

kind	count
Object	4080965
ATProto	112141
AtpBlock	97120
MagicKey	38829
ActivityPub	29996
AtpRepo	7222
AtpSequence	3551
AtpRemoteBlob	1574
Cursor	24

Object lookups by id scheme:

Query

SELECT
  split(JSON_EXTRACT_SCALAR(proto_payload.audit_log.request.keys[0].path[0].name), ':')[0] as scheme,
  count(*)
FROM bridgy-federated.logs._Default
where log_name='projects/bridgy-federated/logs/cloudaudit.googleapis.com%2Fdata_access'
  and proto_payload.audit_log.method_name='google.datastore.v1.Datastore.Lookup'
  and string(proto_payload.audit_log.request.keys[0].path[0].kind) = 'Object'
group by scheme
order by count(*) desc

scheme	count
did	3434021
at	413000
https	235446
http	85
...

for #1149

snarfed · 2024-07-31T20:21:27Z

I don't understand why the DID Object lookups aren't using memcache. The ndb contexts in all three services seem to have it configured right. Here are the top DIDs by number of lookups, in just a ~9h window. Almost all of them should have been cached. Why weren't they?

Query

SELECT
  JSON_EXTRACT_SCALAR(proto_payload.audit_log.request.keys[0].path[0].name) as name,
  count(*)
FROM bridgy-federated.logs._Default
where log_name='projects/bridgy-federated/logs/cloudaudit.googleapis.com%2Fdata_access'
  and proto_payload.audit_log.method_name='google.datastore.v1.Datastore.Lookup'
  and string(proto_payload.audit_log.request.keys[0].path[0].kind) = 'Object'
  and split(JSON_EXTRACT_SCALAR(proto_payload.audit_log.request.keys[0].path[0].name), ':')[0] = 'did'
group by name
order by count(*) desc
limit 100

name	count
did:plc:ayutykgvyf4x7ev5ornltyzz	176211
did:plc:4mjwxpnhoeaknxqabwhf2n6i	64569
did:plc:p2ygpwluon3vrk5yecjq7wc5	56364
did:plc:hm5cxb2g2q4ma4ucsks73tex	53051
did:plc:b5glolrcdfnaxwc4dbe4zbty	43557
did:plc:3rxcq5wacdop5thjoh3sny3p	35103
did:plc:6zfgmvghpidjvcn3cqtitnx5	28309
did:plc:5syae7p7dr6zsfdoe4k6buky	26882
did:plc:dqgdfku26vpxdktkzne5x2xj	26240
did:plc:sese4fb6luojywziva3s7zjo	24876
...

for #1149

snarfed · 2024-08-01T23:50:39Z

Progress here! Managed to cut datastore lookups and queries both way down, 5-10x each.

for snarfed/bridgy-fed#1149

for #1149

snarfed · 2024-08-02T17:42:46Z

Now looking at optimizing log storage. We were doing 250G/mo a bit ago, we're now down to ~150G/mo or so.

The main cost here is initial ingest. We get 50G/mo for free, then $.50/G after that. That includes 30d retention, and our current retention period is set to 30d, so reducing retention wouldn't help. https://cloud.google.com/stackdriver/pricing#cloud-monitoring-pricing

Tips on optimizing logging costs: https://cloud.google.com/architecture/framework/cost-optimization/cloudops#logging

Logs in BigQuery: https://console.cloud.google.com/bigquery?invt=AbjFiQ&project=bridgy-federated&inv=1&ws=%211m0

Logs dashboard: https://console.cloud.google.com/monitoring/dashboards/builder/24c22d42-91d8-4feb-aa6b-99dbb84c6417;duration=PT8H?project=bridgy-federated

snarfed · 2024-08-02T17:44:15Z

The other next cost to look at is CPU. router is currently on four cores, atproto-hub on one. We should be able to get them both down. Here's one place to start: https://cloud.google.com/profiler/docs

snarfed · 2024-08-03T03:57:01Z

Next step for datastore reads: snarfed/arroba#30

snarfed · 2024-08-09T19:20:00Z

Results from the datastore reads optimization here have been disappointing. I cut them by ~3x, but spend on datastore read API calls only went down maybe 1/4-1/3, from $10-12/d to $7-8/d. Hrmph.

(Datastore reads are the top blue part of the cost bars above.)

snarfed · 2024-08-13T19:59:08Z

Another opportunity here could be reducing router and atproto-hub allocated CPU. Router CPU is down to mostly under two cores, atproto-hub to maybe half a core. Could either drop router from 4 to 2, or leave it at 4 and merge atproto-hub into it.

snarfed · 2024-08-15T22:56:51Z

^ The difficulty with merging router and atproto-hub is that we currently run four WSGI workers for router, and we want to run the atproto-hub threads on just one of them. I had trouble running all the threads in just one worker on router a while back, but that may have been a red herring, it may actually work ok after all, and I doubt we're compute-bound in a way that we'd care about the GIL. Worth trying.

snarfed · 2024-08-15T22:58:27Z

Another idea here: stop storing and fetching transient activities (create, update, delete, undo, accept/reject, etc) in the datastore entirely. Just use memcache for them.

snarfed · 2024-08-15T23:03:18Z

...but looking at our incoming activity types, that might not make a big difference. The bulk of incoming activities are likes, follows, and reposts, in that order.

Measure first, then optimize.

(I should double check to make sure that count isn't after various filtering though!)

snarfed · 2024-12-05T21:16:26Z

Another idea here, cache follower counts in memcache instead of instance memory. Not sure how much that would matter, I'll see when I dig into datastore usage.

snarfed · 2024-12-09T23:47:46Z

Yet another idea here, stop storing incoming activities altogether, only store objects.

snarfed · 2024-12-09T23:48:11Z

Oh wow, I didn't realize that the datastore had separate pricing for multi-region vs single-region, https://cloud.google.com/datastore/pricing#location_pricing . Let's see if I can figure out how to switch us to single-region.

snarfed · 2024-12-10T00:05:33Z

OK so right now, the App Engine app is in us-central, see dashboard or gcloud app describe. The Datastore db is nam5 ie multi-region us-central, see dashboard.

The catch may be that datastore says it doesn't support App Engine in us-central1, argh.

snarfed · 2024-12-10T00:15:22Z

Sadly we may be too late here, evidently you can't change a datastore db's location after it's created. I can use a regional endpoint for requests, but I don't think that will affect storage locality or pricing.

snarfed · 2024-12-10T00:33:37Z

Can't get a regional endpoint to work anyway.

from google.api_core.client_options import ClientOptions
with ndb.Client(client_options=ClientOptions(api_endpoint='https://us-east1.googleapis.com')).context():
  w = Web.get_by_id('snarfed.org')
...
google.api_core.exceptions.RetryError: Maximum number of 3 retries exceeded while calling <function make_call.<locals>.rpc_call at 0x108723f60>, last exception: 503 DNS resolution failed for https://us-east1-firestore.googleapis.com:443: C-ares status is not ARES_SUCCESS qtype=A name=https://us-east1-firestore.googleapis.com:443 is_balancer=0: Domain name not found

Also tried variations like us-east1-firestore, us-east1-datastore, and us-central1[-*].

Dropping the https:// got a google.api_core.exceptions.PermissionDenied: 403 Not authorized instead of Domain not found, which seems closer, but still not there yet. Who knows.

snarfed · 2024-12-10T00:42:03Z

OK, redoing datastore API call stats:

Lookups by kind:

Object lookups:

Queries by kind and composite filters:

Huh, the vast majority of queries are still for original users/objects based on copy ids:

snarfed · 2024-12-10T02:28:20Z

Top lookups by id, all ATProto, still not caching DID docs or records enough.

adds new generic common.memcache_memoize decorator for caching any function's output in memcache. for #1149

snarfed · 2024-12-12T16:53:53Z

Started caching queries on User/Object.copies last night ^. It helped, but not as much as I'd hoped, only maybe 25-30%, down from ~70qps to ~50qps. 😕

for #1149

sad, it's useful, but it's too expensive. for #1501, #1149

snarfed · 2024-12-16T18:31:21Z

Looking back at the last 2w, we've made a bit more progress here on datastore calls than I thought. Feeling a bit better now. Still lots more to do though!

for #1149 https://github.com/googleapis/python-ndb/blob/c55ec62b5153787404488b046c4bf6ffa02fee64/google/cloud/ndb/utils.py#L78-L81

for #1149

snarfed · 2024-12-19T21:20:54Z

Added some logging on datastore lookups, we're doing tons without caching, as expected. A few stack traces:

default (frontend): did:plc:ljb6ugearscroyatvai5kkfr:

  File "/workspace/activitypub.py", line 1007, in actor
    user = _load_user(handle_or_id, create=True)
  File "/workspace/activitypub.py", line 985, in _load_user
    user = proto.get_or_create(id) if create else proto.get_by_id(id)
  File "/workspace/models.py", line 379, in get_or_create
    user = _run()
  File "/workspace/models.py", line 302, in _run
    user = cls.get_by_id(id, allow_opt_out=True)
  File "/workspace/models.py", line 270, in get_by_id
    user = cls._get_by_id(id, **kwargs)

  File "/workspace/activitypub.py", line 1007, in actor
    user = _load_user(handle_or_id, create=True)
  File "/workspace/activitypub.py", line 985, in _load_user
    user = proto.get_or_create(id) if create else proto.get_by_id(id)
  File "/workspace/models.py", line 379, in get_or_create
    user = _run()
  File "/workspace/models.py", line 302, in _run
    user = cls.get_by_id(id, allow_opt_out=True)
  File "/workspace/models.py", line 276, in get_by_id
    elif user.status and not allow_opt_out:
  File "/workspace/models.py", line 475, in status
    if not self.obj or not self.obj.as1:
  File "/workspace/models.py", line 973, in as1
    handle = ATProto(id=owner).handle
  File "/workspace/atproto.py", line 257, in handle
    return did_to_handle(self.key.id())
  File "/workspace/atproto.py", line 183, in did_to_handle
    if did_obj := ATProto.load(did, did_doc=True, remote=remote):
  File "/workspace/atproto.py", line 746, in load
    return super().load(id, **kwargs)
  File "/workspace/protocol.py", line 1566, in load
    obj = Object.get_by_id(id)

router: did:plc:rvu67iewv37qcskwd22qe7i4:

  File "/workspace/protocol.py", line 1699, in receive_task
    return PROTOCOLS[obj.source_protocol].receive(
  File "/workspace/protocol.py", line 909, in receive
    obj = Object.get_or_create(id, authed_as=actor, **orig.to_dict())
  File "/workspace/models.py", line 1114, in get_or_create
    obj.put()
  File "/workspace/models.py", line 1047, in _pre_put_hook
    if self.as1 and self.as1.get('objectType') == 'activity':
  File "/workspace/models.py", line 973, in as1
    handle = ATProto(id=owner).handle
  File "/workspace/atproto.py", line 257, in handle
    return did_to_handle(self.key.id())
  File "/workspace/atproto.py", line 183, in did_to_handle
    if did_obj := ATProto.load(did, did_doc=True, remote=remote):
  File "/workspace/atproto.py", line 746, in load
    return super().load(id, **kwargs)
  File "/workspace/protocol.py", line 1566, in load
    obj = Object.get_by_id(id)

  File "/workspace/protocol.py", line 1699, in receive_task
    return PROTOCOLS[obj.source_protocol].receive(
  File "/workspace/protocol.py", line 935, in receive
    Object.get_or_create(inner_obj_id, our_as1=inner_obj_as1,
  File "/workspace/models.py", line 1083, in get_or_create
    orig_as1 = obj.as1
  File "/workspace/models.py", line 973, in as1
    handle = ATProto(id=owner).handle
  File "/workspace/atproto.py", line 257, in handle
    return did_to_handle(self.key.id())
  File "/workspace/atproto.py", line 183, in did_to_handle
    if did_obj := ATProto.load(did, did_doc=True, remote=remote):
  File "/workspace/atproto.py", line 746, in load
    return super().load(id, **kwargs)
  File "/workspace/protocol.py", line 1566, in load
    obj = Object.get_by_id(id)

snarfed · 2024-12-19T21:24:06Z

No good reason, but I vaguely suspect our Key has already been set in this batch patch from googleapis/python-ndb#743 (comment) , below. I'll try disabling it now, along with our tasklets that hydrate objects and authors in pages.serve_feed, to see what happens.

bridgy-fed/flask_app.py

Lines 65 to 101 in b4270a5

    
           # https://github.com/googleapis/python-ndb/issues/743#issuecomment-2067590945 
        
           # 
        
           # fixes "RuntimeError: Key has already been set in this batch" errors due to 
        
           # tasklets in pages.serve_feed 
        
           from logging import error as log_error 
        
           from sys import modules 
        
           from google.cloud.datastore_v1.types.entity import Key 
        
           from google.cloud.ndb._cache import ( 
        
               _GlobalCacheSetBatch, 
        
               global_compare_and_swap, 
        
               global_set_if_not_exists, 
        
               global_watch, 
        
           ) 
        
           from google.cloud.ndb.tasklets import Future, Return, tasklet 
        
           GLOBAL_CACHE_KEY_PREFIX: bytes = modules["google.cloud.ndb._cache"]._PREFIX 
        
           LOCKED_FOR_READ: bytes = modules["google.cloud.ndb._cache"]._LOCKED_FOR_READ 
        
           LOCK_TIME: bytes = modules["google.cloud.ndb._cache"]._LOCK_TIME 
        
           @tasklet 
        
           def custom_global_lock_for_read(key: str, value: str): 
        
               if value is not None: 
        
                   yield global_watch(key, value) 
        
                   lock_acquired = yield global_compare_and_swap( 
        
                       key, LOCKED_FOR_READ, expires=LOCK_TIME 
        
                   ) 
        
               else: 
        
                   lock_acquired = yield global_set_if_not_exists( 
        
                       key, LOCKED_FOR_READ, expires=LOCK_TIME 
        
                   ) 
        
               if lock_acquired: 
        
                   raise Return(LOCKED_FOR_READ) 
        
           modules["google.cloud.ndb._cache"].global_lock_for_read = custom_global_lock_for_read

…ges.serve_feed for #1149 (comment)

snarfed · 2024-12-19T22:24:06Z

Nope. (Dashed line marks when this ^ was deployed.)

…rs in pages.serve_feed" This reverts commit 01ba8b5. didn't help with ndb caching for #1149

for #1149

https://cloud.google.com/appengine/docs/flexible/using-shared-vpc https://cloud.google.com/appengine/docs/flexible/reference/app-yaml#network_settings https://cloud.google.com/vpc/docs/serverless-vpc-access#supported_services for #1149

for #1149

snarfed · 2024-12-20T20:54:52Z

Still struggling to instrument datastore lookups and understand why they're not getting cached. Got logging on the source side of ndb lookups, via google.cloud.ndb._datastore_api definitely seems to confirm they're not getting cached right, in either the frontend or router. Also added logging to google.cloud.ndb.global_cache in snarfed/python-ndb@d1d399f, seeing it locally, but not in prod at all. 😕 Maybe the build isn't picking up that branch from GitHub right?

snarfed · 2024-12-21T23:54:40Z

Here's everything I can think of that I've done to get ndb caching in memcache working, or to even check that it's working:

Configured ndb to use memcache for its global cache:

bridgy-fed/app.yaml

Line 37 in 382b30e

MEMCACHE_HOST: '10.126.144.3'

bridgy-fed/common.py

Lines 108 to 118 in 382b30e

    
           if memcache_host := os.environ.get('MEMCACHE_HOST'): 
        
               logger.info(f'Using real memcache at {memcache_host}') 
        
               memcache = pymemcache.client.base.PooledClient( 
        
                   memcache_host, timeout=10, connect_timeout=10,  # seconds 
        
                   allow_unicode_keys=True) 
        
               pickle_memcache = pymemcache.client.base.PooledClient( 
        
                   memcache_host, timeout=10, connect_timeout=10,  # seconds 
        
                   serde=PickleSerde(), allow_unicode_keys=True) 
        
               # ideally we'd use MemcacheCache.from_environment, but it doesn't let us 
        
               # pass kwargs like serde to the pymemcache client constructor 
        
               global_cache = MemcacheCache(memcache, strict_read=True)

bridgy-fed/common.py

Lines 484 to 491 in 382b30e

    
           NDB_CONTEXT_KWARGS = { 
        
               # limited context-local cache. avoid full one due to this bug: 
        
               # https://github.com/googleapis/python-ndb/issues/888 
        
               'cache_policy': cache_policy, 
        
               'global_cache': global_cache, 
        
               'global_cache_policy': global_cache_policy, 
        
               'global_cache_timeout_policy': global_cache_timeout_policy, 
        
           }

Enabled ndb debug logging with NDB_DEBUG=true and logging.getLogger('google.cloud.ndb.global_cache').setLevel(logging.DEBUG).
Checked the build to confirm that it's using my ndb fork and branch with global_cache logging.
Ran unit tests locally with eg env MEMCACHE_HOST=localhost NDB_DEBUG=true python -m unittest -v ..., confirmed that I see google.cloud.ndb.global_cache output.

I still don't see google.cloud.ndb.global_cache log lines in prod, and I still see repeated datastore lookups for the same key.

…ng else 2h for #1149

for #1149

snarfed added the now label Jun 25, 2024

snarfed mentioned this issue Jun 27, 2024

Improve feed polling efficiency #791

Closed

snarfed added a commit that referenced this issue Jul 30, 2024

cache models.get_originals in memcache with new memcache_memoize deco…

88cbe3b

…rator for #1149

snarfed added a commit that referenced this issue Jul 31, 2024

disable caching get_originals (88cbe3b) until bugs are fixed

3529216

for #1149

snarfed added a commit that referenced this issue Aug 1, 2024

use ndb in-memory context cache for DID doc Objects

4097fe7

for #1149

snarfed added a commit that referenced this issue Aug 1, 2024

cache models.get_originals in memory

5679d69

for #1149

snarfed added a commit to snarfed/webutil that referenced this issue Aug 2, 2024

minor logging optimizations

3492551

for snarfed/bridgy-fed#1149

snarfed added a commit that referenced this issue Aug 2, 2024

minor logging optimizations

70dd6d3

for #1149

snarfed added a commit that referenced this issue Aug 2, 2024

more minor logging optimizations

3acde57

for #1149

snarfed added a commit that referenced this issue Dec 12, 2024

cache models.get_original_user/object_key queries in memcache

5d6d68b

adds new generic common.memcache_memoize decorator for caching any function's output in memcache. for #1149

snarfed added a commit that referenced this issue Dec 13, 2024

cache User.count_followers in memcache

6388aab

for #1149

snarfed added a commit that referenced this issue Dec 15, 2024

drop per-target delivery tracking

63c0e25

sad, it's useful, but it's too expensive. for #1501, #1149

snarfed added a commit that referenced this issue Dec 18, 2024

temporarily enable debug level ndb logging

9af6238

for #1149 https://github.com/googleapis/python-ndb/blob/c55ec62b5153787404488b046c4bf6ffa02fee64/google/cloud/ndb/utils.py#L78-L81

snarfed added a commit that referenced this issue Dec 18, 2024

use our usual ndb context kwargs in arroba DatabaseStorage

36cbbdd

for #1149

snarfed added a commit that referenced this issue Dec 19, 2024

set up, but don't enable, ndb debug logging

d3490ca

for #1149

snarfed added a commit that referenced this issue Dec 19, 2024

temporarily turn on ndb logging for cache and lookups

b4270a5

for #1149

snarfed added a commit that referenced this issue Dec 19, 2024

temporarily disable tasklet patch and hydrating objects/authors in pa…

01ba8b5

…ges.serve_feed for #1149 (comment)

snarfed added a commit that referenced this issue Dec 20, 2024

Revert "temporarily disable tasklet patch and hydrating objects/autho…

572ca2c

…rs in pages.serve_feed" This reverts commit 01ba8b5. didn't help with ndb caching for #1149

snarfed added a commit that referenced this issue Dec 20, 2024

setup to run against local memcache and log ndb global_cache

0726641

for #1149

snarfed added a commit that referenced this issue Dec 20, 2024

setup to run against local memcache and log ndb global_cache

a00ab0d

for #1149

snarfed added a commit that referenced this issue Dec 20, 2024

setup to run against local memcache and log ndb global_cache

382b30e

for #1149

snarfed added a commit that referenced this issue Jan 11, 2025

bump up ndb cache timeouts: users and profiles indefinitely, everythi…

bf3f27d

…ng else 2h for #1149

snarfed added a commit that referenced this issue Jan 12, 2025

temporarily enable ndb context cache for everything

18aa302

for #1149

Optimize costs #1149

Optimize costs #1149

Comments

snarfed commented Jun 24, 2024

snarfed commented Jun 26, 2024

snarfed commented Jun 26, 2024 • edited Loading

snarfed commented Jun 26, 2024

snarfed commented Jun 28, 2024 • edited Loading

snarfed commented Jun 28, 2024 • edited Loading

snarfed commented Jun 28, 2024

snarfed commented Jun 28, 2024

snarfed commented Jul 25, 2024

snarfed commented Jul 30, 2024

snarfed commented Jul 30, 2024

snarfed commented Jul 30, 2024 • edited Loading

snarfed commented Jul 31, 2024

snarfed commented Aug 1, 2024

snarfed commented Aug 2, 2024 • edited Loading

snarfed commented Aug 2, 2024

snarfed commented Aug 3, 2024

snarfed commented Aug 9, 2024

snarfed commented Aug 13, 2024 • edited Loading

snarfed commented Aug 15, 2024 • edited Loading

snarfed commented Aug 15, 2024 • edited Loading

snarfed commented Aug 15, 2024

snarfed commented Dec 5, 2024

snarfed commented Dec 9, 2024 • edited Loading

snarfed commented Dec 9, 2024

snarfed commented Dec 10, 2024

snarfed commented Dec 10, 2024

snarfed commented Dec 10, 2024

snarfed commented Dec 10, 2024 • edited Loading

snarfed commented Dec 10, 2024

snarfed commented Dec 12, 2024

snarfed commented Dec 16, 2024

snarfed commented Dec 19, 2024

snarfed commented Dec 19, 2024

snarfed commented Dec 19, 2024

snarfed commented Dec 20, 2024

snarfed commented Dec 21, 2024 • edited Loading

snarfed commented Jun 26, 2024 •

edited

Loading

snarfed commented Jun 28, 2024 •

edited

Loading

snarfed commented Jun 28, 2024 •

edited

Loading

snarfed commented Jul 30, 2024 •

edited

Loading

snarfed commented Aug 2, 2024 •

edited

Loading

snarfed commented Aug 13, 2024 •

edited

Loading

snarfed commented Aug 15, 2024 •

edited

Loading

snarfed commented Aug 15, 2024 •

edited

Loading

snarfed commented Dec 9, 2024 •

edited

Loading

snarfed commented Dec 10, 2024 •

edited

Loading

snarfed commented Dec 21, 2024 •

edited

Loading