我正在尝试查询数据库并将可能大到100万行(~200MB)的excel文件作为varbinary
存储并传递给验证器。
我们的构建服务器具有 6GB 的内存和负载平衡处理器,并且在运行期间无法将CPU或内存最大化。
然而,在大约40秒后,该过程抛出OutOfMemoryException
。
这是堆栈跟踪:
System.OutOfMemoryException: Exception of type 'System.OutOfMemoryException' was thrown.
at System.Data.SqlTypes.SqlBinary.get_Value()
at System.Data.SqlClient.SqlBuffer.get_ByteArray()
at System.Data.SqlClient.SqlBuffer.get_Value()
at System.Data.SqlClient.SqlDataReader.GetValueFromSqlBufferInternal(SqlBuffer data, _SqlMetaData metaData)
at System.Data.SqlClient.SqlDataReader.GetValueInternal(Int32 i)
at System.Data.SqlClient.SqlDataReader.GetValue(Int32 i)
at System.Data.SqlClient.SqlCommand.CompleteExecuteScalar(SqlDataReader ds, Boolean returnSqlValue)
at System.Data.SqlClient.SqlCommand.ExecuteScalar()
at eConfirmations.DataService.FileServices.FileDataService.GetFileContent(Guid fileId) in d:\w1\3\s\Source\eConfirmations.DataService\FileServices\FileDataService.cs:line 157
...
at System.Data.SqlTypes.SqlBinary.get_Value()
at System.Data.SqlClient.SqlBuffer.get_ByteArray()
at System.Data.SqlClient.SqlBuffer.get_Value()
at System.Data.SqlClient.SqlDataReader.GetValueFromSqlBufferInternal(SqlBuffer data, _SqlMetaData metaData)
at System.Data.SqlClient.SqlDataReader.GetValueInternal(Int32 i)
at System.Data.SqlClient.SqlDataReader.GetValue(Int32 i)
at System.Data.SqlClient.SqlCommand.CompleteExecuteScalar(SqlDataReader ds, Boolean returnSqlValue)
at System.Data.SqlClient.SqlCommand.ExecuteScalar()
at eConfirmations.DataService.FileServices.FileDataService.GetFileContent(Guid fileId) in d:\w1\3\s\Source\eConfirmations.DataService\FileServices\FileDataService.cs:line 157
这是我抛出异常的代码:
private byte[] GetFileContent(Guid fileId)
{
byte[] content;
string connectionString = ConfigurationManager.ConnectionStrings["eConfirmationsDatabase"].ConnectionString;
using (SqlConnection sqlConnection = new SqlConnection(connectionString))
{
using (SqlCommand sqlCommand = sqlConnection.CreateCommand())
{
sqlCommand.CommandTimeout = 300;
sqlCommand.CommandText = $"SELECT Content FROM dbo.[File] WHERE FileId = '{fileId}'";
sqlConnection.Open();
content = sqlCommand.ExecuteScalar() as byte[];
sqlConnection.Close();
sqlCommand.Dispose();
}
sqlConnection.Dispose();
}
return content;
}
是否有更有效的方法来撤回此数据,或者我们是否可以更新构建服务器上的设置以避免此错误?
答案 0 :(得分:2)
好的,这就是发生了什么:
因为它运行在32位版本上,所以最大内存分配为2GB,但我仍然无法接近该阈值。
根据与我的情况非常相似的this stackoverflow post,.NET框架将对象限制在内存中的256MB
限制。
因此即使我的文件只有200MB,byte[]
和MemoryStreams
也会以2的幂扩展,直到达到所需的256 MB为止。当它们展开时,它们会创建一个适当大小的新实例,并将旧数据复制到新实例,有效地将内存使用量乘以3 ,从而导致异常。
MSDN has an example of how to retrieve a large file using a FileStream,而不是FileStream,我使用this post预先初始化为我的数据大小的静态byte []。
以下是我的最终解决方案:
public File GetFileViaFileIdGuid(Guid fileId)
{
File file = new File();
string connectionString = ConfigurationManager.ConnectionStrings["Database"].ConnectionString;
using (var sourceSqlConnection = new SqlConnection(connectionString))
{
using (SqlCommand sqlCommand = sourceSqlConnection.CreateCommand())
{
sqlCommand.CommandText = $"SELECT FileName, FileExtension, UploadedDateTime, DATALENGTH(Content) as [ContentLength] FROM dbo.[File] WHERE FileId = '{fileId}'";
sqlCommand.CommandType = CommandType.Text;
sqlCommand.CommandTimeout = 300;
sourceSqlConnection.Open();
var reader = sqlCommand.ExecuteReader();
while (reader.Read())
{
file.FileId = fileId;
file.FileExtension = reader["FileExtension"].ToString();
file.FileName = reader["FileName"].ToString();
file.UploadedDateTime = (DateTime)reader["UploadedDateTime"];
file.Content = new byte[Convert.ToInt32(reader["ContentLength"])];
}
reader.Close();
sourceSqlConnection.Close();
}
}
file.Content = GetFileContent(file.FileId, file.Content.Length);
return file;
}
并获取内容:
private byte[] GetFileContent(Guid fileId, int contentLength)
{
int outputSize = 1048576;
int bufferSize = contentLength + outputSize;
byte[] content = new byte[bufferSize];
string connectionString = ConfigurationManager.ConnectionStrings["Database"].ConnectionString;
using (SqlConnection sqlConnection = new SqlConnection(connectionString))
{
using (SqlCommand sqlCommand = sqlConnection.CreateCommand())
{
sqlCommand.CommandTimeout = 300;
sqlCommand.CommandText = $"SELECT Content FROM dbo.[File] WHERE FileId = '{fileId}'";
sqlConnection.Open();
using (SqlDataReader reader = sqlCommand.ExecuteReader(CommandBehavior.SequentialAccess))
{
while (reader.Read())
{
int startIndex = 0;
long returnValue = reader.GetBytes(0, startIndex, content, startIndex, outputSize);
while (returnValue == outputSize)
{
startIndex += outputSize;
returnValue = reader.GetBytes(0, startIndex, content, startIndex, outputSize);
}
}
}
sqlConnection.Close();
}
}
return content;
}