If we only have one database in one location then that wouldn't work, right? We'd need a distributed solution so what could we use? For serverless functions to start maybe we use Cloudflare workers? Maybe we'd then have a strategy for moving off cloud and towards buying/renting more datacenter space as we scale?
I'm interested in hearing thoughts about this. If you were to build a fast search engine that could scale to millions of users today, how would you do it?
Issue with search is there's a lot of data, and a lot of compute to run over it, so you end up replicating PBs just to reduce latency.
For database, first I'd lean on plain file s3-like storage as much as possible for spidering and indexing data and try to keep the user-facing database under a couple hundred terabytes, I'd probably favor some bare metal at this point but that will fit on some "db as a service" providers.