钻取配置:
blob
dfs
local
的本地磁盘文件夹虚拟机
Azure高级文件共享
dd
,1GB数据,各种块大小,conv = fdatasync进行测量sudo sh -c "echo 3 > /proc/sys/vm/drop_caches"
)之前,FS缓存已丢弃+-------+------------+--------+
| Mode | Block size | Speed |
+-------+------------+--------+
| Write | 1024 | 37MB/s |
| Write | 64 | 16MBs |
| Read | 1024 | 70MB/s |
| Read | 64 | 44MB/s |
+-------+------------+--------+
+-------+------------+---------+
| Mode | Block size | Speed |
+-------+------------+---------+
| Write | 1024 | 100MB/s |
| Write | 64 | 23MBs |
| Read | 1024 | 88MB/s |
| Read | 64 | 40MB/s |
+-------+------------+---------+
天蓝色Blob的最大已知吞吐量为60MB / s。上传/下载速度固定为目标存储的读写速度。
iotop
观察到的IO性能-- Query A: Reading parquet
select sum(`Price`) as test from namespace.`Parquet/**/*.parquet`;
-- Query B: Reading CSV
select sum(CAST(`Price` as DOUBLE)) as test from namespace.`sales.csv`;
+-------------+--------------------+----------+-----------------+
| Query | Source (namespace) | Duration | Disk read usage |
+-------------+--------------------+----------+-----------------+
| A (Parquet) | dfs(smb) | 14.8s | 2.8 - 3.5 MB/s |
| A (Parquet) | blob | 24.5s | N/A |
| A (Parquet) | local | 1.7s | 40 - 80 MB/s |
| --- | --- | --- | --- |
| B (CSV) | dfs(smb) | 22s | 30 - 60 MB/s |
| B (CSV) | blob | 29s | N/A |
| B (CSV) | local | 18s | 68 MB/s |
+-------------+--------------------+----------+-----------------+
cisfd
进程才具有IO性能。答案 0 :(得分:0)
我假设您将使用Azure Monitor交叉检查IO性能问题,并且如果问题仍然存在,我希望在此问题上紧密合作。这可能需要更深入的调查,因此,如果您有支持计划,请请求您提供支持票,否则,请告知我们,我们将尽力帮助您获得一次性的免费技术支持。在这种情况下,您是否可以参考此线程向AzCommunity [at] Microsoft [dot] com发送电子邮件。请在主题字段中提及“ ATTN subm”。感谢您在此问题上的合作,并期待您的答复。