LSM checkpointing and merging can fall way behind when there are many threads inserting.
Part of the problem is that writing in-memory chunks to disk is inherently single-threaded.
Another part of the problem is the heuristic for throttling inserts: what we have just doesn't work once there are more than a few threads updating.