请参阅以下关于不同方式的一些测试比较的原始问题:
所以到目前为止我尝试了两种方法:
1.使用Get Folder Size from Windows Command Line:
中的代码遍历目录@echo off
set size=0
for /r %%x in (folder\*) do set /a size+=%%~zx
echo %size% Bytes
2.保存
的输出'dir %folder% /s /a'
进入文本文件,然后读入底部的大小
3.我现在尝试的最后一种方法是使用du(来自MS的磁盘实用工具 - https://technet.microsoft.com/en-us/sysinternals/bb896651.aspx)。
现在除#3外,这两种方式似乎都太慢了我所需要的(数以千计的文件)。所以问题是哪一个是最快的/应该是最快的,如果有任何其他快速(呃)方法来获得具有100k +文件的文件夹内容的大小(并且有100个文件夹)
下面是我非常讨厌的比较方法(我的程序看到了一些输出) 有一些小错误,其中一些部分,如选项3将失败,因为它试图处理大于32位限制的数字,我敢肯定还有一些问题,但我认为一般的时间是明显的,除非我真的搞砸了我的逻辑。
选项I:遍历目录,使用VB脚本读取'dir'的文本输出并查找最后的大小+将其转换为MB(最初是从其他地方得到的,我实际上失去了地方我从中得到了) 选项II:使用findstr管道迭代并直接输出结果(不转换为MB) - 来自@MC ND 选项III:使用compact命令迭代 - 来自@npocmaka 选项IV:来自@ user1016274 - 使用robocoby
(还有一些答案,但这些是我能够加入的答案)
这些是我得到的结果,它们相互之间非常一致,robocopy将它们吹走了
选项I和选项II通常都很接近,选项II稍微好一点(两者都是1分10秒到2分10秒,不确定差异来自何处) 第三部分 - 16-17分钟 第四部分 - 10-20秒
@echo OFF
setlocal enabledelayedexpansion
REM OPTION I - directory iteration
REM OPTION II - iteration with findstr pipe
REM OPTION III - compact
:MAIN
REM Initialize log filename
for /f "delims=" %%a in ('echo %date:~10,4%%date:~4,2%%date:~7,2%%time:~0,2%%time:~3,2%%time:~6,2%') do @set LOGFILEPOSTFIX=%%a
set LOGFILEPOSTFIX=%date:~10,4%%date:~4,2%%date:~7,2%%time:~0,2%%time:~3,2%%time:~6,2%
set TIMESTAMP=%date:~10,4%_%date:~4,2%_%date:~7,2%_%time:~0,2%_%time:~3,2%_%time:~6,2%
echo %TIMESTAMP%
set "LOGFILE=Proj_not_in_db_%LOGFILEPOSTFIX%.log"
set option=1
set TIMESTAMP=%date:~10,4%_%date:~4,2%_%date:~7,2%_%time:~0,2%_%time:~3,2%_%time:~6,2%
echo %TIMESTAMP% - PART I ---- Directory Listing into file, iterate through the sizes of all files inside folder >> %LOGFILE%
echo %TIMESTAMP% - PART I
call :PROCESSFOLDER
set TIMESTAMP=%date:~10,4%_%date:~4,2%_%date:~7,2%_%time:~0,2%_%time:~3,2%_%time:~6,2%
echo %TIMESTAMP% - PART I ---- END >> %LOGFILE%
echo %TIMESTAMP% - PART I - END
set option=2
set TIMESTAMP=%date:~10,4%_%date:~4,2%_%date:~7,2%_%time:~0,2%_%time:~3,2%_%time:~6,2%
echo %TIMESTAMP% - PART II findstr pipe ---- >> %LOGFILE%
echo %TIMESTAMP% - PART II
call :PROCESSFOLDER
set TIMESTAMP=%date:~10,4%_%date:~4,2%_%date:~7,2%_%time:~0,2%_%time:~3,2%_%time:~6,2%
echo %TIMESTAMP% - PART II ---- END>> %LOGFILE%
echo %TIMESTAMP% - PART II - END
set option=3
set TIMESTAMP=%date:~10,4%_%date:~4,2%_%date:~7,2%_%time:~0,2%_%time:~3,2%_%time:~6,2%
echo %TIMESTAMP% - PART III compact ---- >> %LOGFILE%
echo %TIMESTAMP% - PART III
call :PROCESSFOLDER
set TIMESTAMP=%date:~10,4%_%date:~4,2%_%date:~7,2%_%time:~0,2%_%time:~3,2%_%time:~6,2%
echo %TIMESTAMP% - PART III ---- END>> %LOGFILE%
echo %TIMESTAMP% - PART III - END
set option=4
set TIMESTAMP=%date:~10,4%_%date:~4,2%_%date:~7,2%_%time:~0,2%_%time:~3,2%_%time:~6,2%
echo %TIMESTAMP% - PART IV robocopy ---- >> %LOGFILE%
echo %TIMESTAMP% - PART IV
call :PROCESSFOLDER
call :CLEANUP
echo FINAL
pause
goto :EOF
:PROCESSFOLDER
echo C:\Windows
echo Processing C:\Windows >> %LOGFILE%
break > projects_in_folder.tmp
for /f "tokens=1-4,* SKIP=7" %%b IN ('dir "C:\Windows" /Q /TW /AD') do (
set _folder=%%f
REM Don't write the 2 lines at the end displaying summary information
if NOT "%%e" EQU "bytes" (
SET _folder=!_folder:~23!
echo !_folder!,%%b>> projects_in_folder.tmp
)
)
set "folder_path=C:\Windows"
call :COMPARE
goto :EOF
:COMPARE
set file_name=%folder_path:\=_%
break > "%file_name%.txt"
if %option%==4 (
set "full_path=C:\Windows"
call :GETFOLDERINFO4
set TIMESTAMP=%date:~10,4%_%date:~4,2%_%date:~7,2%_%time:~0,2%_%time:~3,2%_%time:~6,2%
echo %TIMESTAMP% - PART IV ---- END>> %LOGFILE%
echo %TIMESTAMP% - PART IV - END
)
for /f "tokens=1,2* delims=," %%a in (projects_in_folder.tmp) do (
for /f "tokens=1,* delims=_" %%x in ("%%a") do (
set "projcode=%%x"
)
set full_path=%folder_path%\%%a
if %option%==1 call :GETFOLDERINFO
if %option%==2 call :GETFOLDERINFO2
if %option%==3 call :GETFOLDERINFO3
echo PROJ: %%a SIZE: !totalsize! LASTMODIFIED: %%b >> %LOGFILE%
)
goto :EOF
:GETFOLDERINFO2
set "size=0"
set target=!full_path!
for /f "tokens=3,5" %%a in ('
dir /a /s /w /-c "%target%"
^| findstr /b /l /c:" "
') do if "%%b"=="" set "size=%%a"
echo %size%
set totalsize=%size%
goto :EOF
:GETFOLDERINFO4
pushd "%full_path%" || goto :EOF
setlocal
for /f "tokens=1-10,* delims= " %%a in ('
robocopy %full_path% %TEMP% /S /L /BYTES /XJ /NFL /NDL /NJH ^| find "Bytes"
') do echo %full_path%: %%c
popd
goto :EOF
:GETFOLDERINFO
set totalsize=0
dir "%full_path%" /s /a > size.txt
REM Run VBScript that outputs size in MB which is saved
pushd %~dp0
start /b "" cscript /nologo foldersize.vbs
FOR /F "usebackq tokens=*" %%r in (`CSCRIPT "foldersize.vbs"`) DO SET totalsize=%%r
echo bla > nul
goto :EOF
:GETFOLDERINFO3
set "last=#"
set "_size="
for /f "tokens=1 delims= " %%s in ('compact /s:"%full_path%" /q ') do (
set "_size=!last!"
set "last=%%s"
)
set "_size=%_size: =%"
set "_size=%_size: =%"
set "_size=%_size:.=%"
set "_size=%_size:,=%"
set "_size=%_size: =%"
echo folder size is : %_size% bytes
set totalsize=%_size%
goto :EOF
:CLEANUP
DEL /Q /S projects_in_folder.tmp
DEL /Q /S size.txt
goto :EOF
答案 0 :(得分:6)
您可以尝试(根据您的第二个案例的精神)
@echo off
setlocal enableextensions disabledelayedexpansion
set "target=%~1"
if not defined target set "target=%cd%"
set "size=0"
for /f "tokens=3,5" %%a in ('
dir /a /s /w /-c "%target%"
^| findstr /b /l /c:" "
') do if "%%b"=="" set "size=%%a"
echo %size%
答案 1 :(得分:5)
经过一些测试并比较
的性能 dir /s
compact /s
和powershell GetChild-Item
我发现使用robocopy
要快得多。另一个优点是,即使很长的路径也不会导致错误(路径中包含大约256个字符),例如在深层嵌套的文件夹中。
如果您不想对结点后面的数据进行计数,这些数据很容易包含在robocopy
中,如下所示:
@echo off
pushd "%~1" || goto :EOF
for /f "tokens=2 delims= " %%a in ('
robocopy "%CD%" "%TEMP%" /S /L /BYTES /XJ /NFL /NDL /NJH /R:0 ^| find "Bytes"
') do echo %CD%: %%a
popd
如果省略/BYTES
选项,您将获得格式为MB或GB的大小值。在这种情况下,使用另一个循环变量,必须打印尺寸(k,m,g,t表示千克,兆,千兆,tera):
for /f "tokens=2-3 delims= " %%a in ('
robocopy "%CD%" "%TEMP%" /S /L /XJ /NFL /NDL /NJH /R:0 ^| findstr "Bytes"
') do (
set dim=%%b
set "dim=!dim:k=KB!" & set "dim=!dim:m=MB!" & set "dim=!dim:g=GB!" & set "dim=!dim:t=TB!"
if !dim! EQU %%b set dim=B
echo ^ %CD%: %%a !dim!
)
%%b
包含维度字母或数字值。这是通过替换来测试的,以避免set /A
的32位限制。
答案 2 :(得分:3)
由于您愿意使用VBScript(基于您的问题下面的注释),因此您只需使用FileSystemObject Folder对象的Size属性即可。它报告文件夹中所有文件的总大小,包括所有子文件夹中的文件(递归)。
以下简单的JScript脚本打印出当前文件夹的大小:
var fso = new ActiveXObject("Scripting.FileSystemObject");
WScript.Echo(fso.GetFolder('.').Size);
我选择JScript而不是VBScript,因为在批处理脚本中嵌入JScript很简单(尽管有些方法可以对VBScript执行相同的操作)。
这是一个简单的混合脚本实用程序,它将您传入的任何路径的总大小报告为第一个也是唯一的参数。混合脚本使调用非常方便,因为您不必指定CSCRIPT。
<强> FolderSize.bat 强>
@if (@X)==(@Y) @end /* Harmless hybrid line that begins a JScript comment
::FolderSize.bat FolderPath
::
:: Print the total size of all files within FolderPath,
:: including all sub-folders, recursively.
::******** Batch Code *********
@echo off
cscript //nologo //e:jscript "%~f0" %1
exit /b
********** JScript Code *******/
var fso = new ActiveXObject("Scripting.FileSystemObject");
WScript.Echo(fso.GetFolder(WScript.Arguments.Unnamed(0)).Size);
唯一的限制是您必须能够访问文件夹中的所有文件夹(和文件?),否则会失败并显示错误消息。
答案 3 :(得分:1)
试试这个:
:foldersize
@echo off
pushd "%~1"
setlocal
set "_size="
for /f "tokens=1 delims=t" %%s in ('compact /s /q ^|find " total bytes"') do (
set "_size=%%s"
)
set "_size=%_size: =%"
set "_size=%_size: =%"
set "_size=%_size:.=%"
set "_size=%_size:,=%"
set "_size=%_size: =%"
echo folder size is : %_size% bytes
endlocal
popd
它接受一个参数 - 文件夹。compact /s /q
(/ q用于报告,因此不会应用任何更改)产生较少的输出,并且有机会比DIR
更快。
编辑:一些优化的变体(一个是@MC MD&#39;一个 - 可能更快)。想法是跳过FIND或FINDSTR用法,因为它们是外部程序并会使脚本变慢:
:foldersize
@echo off
pushd "%~1"
setlocal enableDelayedExpansion
set "last=#"
set "_size="
for /f "tokens=1 delims= " %%s in ('compact /s /q') do (
set "_size=!last!"
set "last=%%s"
)
set "_size=%_size: =%"
set "_size=%_size: =%"
set "_size=%_size:.=%"
set "_size=%_size:,=%"
set "_size=%_size: =%"
echo folder size is : %_size% bytes
endlocal
popd
和
@echo off
:original script by MC ND
setlocal enableextensions enableDelayedExpansion
set "target=%~1"
if not defined target set "target=%cd%"
set "size=0"
set "last=#"
set "pre_last=#"
rem set "pre_pre_last=#"
for /f "tokens=3" %%a in ('
dir /a:-d /s /w /-c "%target%"
') do (
set "pre_last=!last!"
set "last=%%a"
)
echo !pre_last!
答案 4 :(得分:0)
我认为循环compact
或dir
命令的每一行输出都是低效的,可以通过过滤中间结果来避免:
@echo off
REM dirsize.cmd 2015-05-29
pushd "%~1" || goto :EOF
setlocal
for /f "tokens=1-3*" %%A in ('compact /s /a /q ^| find "Datenbytes" ^| find /v "Auflistung"') do echo %CD%: %%A %%B %%C
popd
的变化:
- 如果给定路径不存在而脚本将扫描当前目录,脚本将终止
- compact /a
用于包含隐藏文件和系统文件
- 完整输出通过管道传输到find
。这是需要与语言环境相关的搜索字符串,以过滤掉汇总行。在德语中它是“Datenbytes”,但这也可以包含在foldername中。因此,第二个负滤波器将抑制这些。同样,依赖于语言环境(但不要求独立性)
优点是find
将比具有变量赋值的shell循环更快地丢弃输出行。调用它的成本是可以忽略的。
请注意,compact /q
将不停止压缩操作。它只会缩短输出。在compress
的调用中不提供任何参数将使其仅列出而不是压缩文件/文件夹。
修改强> 虽然这些点都是有效的恕我直言,但请以更快的方式查看我的其他答案。
答案 5 :(得分:0)
如果您不反对使用PowerShell,可以使用以下快速脚本:
param([String]$path=".")
Get-ChildItem $path | Measure-Object -property length -sum