Heavy ZFS flush activity impacting QuestDB performance?

Aditya_Rawat · April 17, 2025, 8:48am

Hi all, I’m running QuestDB on ZFS and noticed extremely high disk write activity (almost ~1 GB/s) from a kernel thread: [flush-zfs-2]. due to whcih my disk util is always 100%.

Is this level of ZFS background flushing expected?

puzpuzpuz · April 17, 2025, 11:31am

Hi Aditya,

Usually, ZFS doesn’t write unexpectedly high volume of data to disk.

What’s your ingestion scenario? Are you inserting lots of rows? Which client/protocol do you use to insert data? Understanding your load might help us understand what’s going on with ZFS.

Aditya_Rawat · April 19, 2025, 12:56pm

We are inserting approximately 2 MB of data (~7.5k rows) every 7 to 25 millis using ILP over TCP, with 18 concurrent connections writing in parallel to different tables.

We’ve also disabled ZFS sync on the dataset (sync=disabled), but it hasn’t had any noticeable impact on performance or flushing behavior.

puzpuzpuz · April 19, 2025, 6:53pm

That’s a relatively high ingestion rate. In our practice, ZFS becomes a bottleneck under high ingestion load. How is important the compression in your case? If it’s not terribly important, you should try ext4 or xfs. And in case if you’re using a network attached disk, like AWS EBS, make sure you have it configured to the maximum possible throughput and IOPS.

Aditya_Rawat · April 21, 2025, 9:06am

I’m okay with using XFS for real-time ingestion to avoid performance issues, but I plan to migrate the data to a ZFS disk afterward. If compression is enabled on the ZFS dataset, will the data be compressed during migration? Will this setup work reliably — ingesting on XFS, then storing long-term on a separate compressed ZFS disk?

Since the streaming database can stay on XFS for performance, but the historical data needs to go on ZFS due to large storage requirements, what would be the best and most efficient way to handle the migration?

Also, what are the recommended settings for the XFS filesystem in a high-throughput write workload?

Using a 4TB NVME M2 Gen4 Drive (Model : CT4000P3PSSD8).

puzpuzpuz · April 22, 2025, 6:51am

With a single database instance this may be tricky as you’d have to detach partitions, copy them to the ZFS volume, and attach them via a sym link:

We’re working on native Apache Parquet support which will provide better compression ratio than ZFS. The idea is to support SQL statements for converting the older partitions from/to Apache Parquet format, with enabled encoding and compression. Hard to name an ETA for this feature, but hopefully it’s ready in a few months.

As for special XFS tuning, so far the defaults worked fine for us.

nwoolmer · April 22, 2025, 10:01am

We do have IN VOLUME , but this is for entire tables only, not just some partitions. It also has some bugs when dropping and re-creating tables.

That being said, you could have a separate table on a zfs volume with historical data, and then one on xfs with real-time data. Then you periodically copy data across in bulk via INSERT INTO SELECT, and drop it from your real-time xfs table.

Then you can JOIN across both tables.

These are the bugs I know of related to that feature:

Whether they would be dealbreakers, or worth working around in this case, is up to you!

Topic		Replies	Views
QuestDB High CPU Usage with Table-Per-Metric Approach (Ex-InfluxDB Refugee) Community question	4	88	May 2, 2025
Weird behavior of questdb under out of order sparse data Community question	1	37	February 14, 2025
Ingestion slowness with Tick Data Community question	1	61	January 24, 2025
Deploying on Cloud, SSD vs HDD? Community	1	67	July 8, 2024
Increased disk and storage with many tables Community question	1	180	August 2, 2024

Heavy ZFS flush activity impacting QuestDB performance?

Related topics