我有一个应用可让人们上传文件,表示为UploadedFiles
。但是,我想确保用户只上传xml文件。我知道我可以使用magic
执行此操作,但我不知道在哪里进行此检查 - 我无法将其放在clean
函数中,因为{{1时文件尚未上传据我所知,我跑了。
以下是clean
模型:
UploadedFile
答案 0 :(得分:21)
验证文件是一个常见的挑战,所以我想使用验证器:
import magic
from django.utils.deconstruct import deconstructible
from django.template.defaultfilters import filesizeformat
@deconstructible
class FileValidator(object):
error_messages = {
'max_size': ("Ensure this file size is not greater than %(max_size)s."
" Your file size is %(size)s."),
'min_size': ("Ensure this file size is not less than %(min_size)s. "
"Your file size is %(size)s."),
'content_type': "Files of type %(content_type)s are not supported.",
}
def __init__(self, max_size=None, min_size=None, content_types=()):
self.max_size = max_size
self.min_size = min_size
self.content_types = content_types
def __call__(self, data):
if self.max_size is not None and data.size > self.max_size:
params = {
'max_size': filesizeformat(self.max_size),
'size': filesizeformat(data.size),
}
raise ValidationError(self.error_messages['max_size'],
'max_size', params)
if self.min_size is not None and data.size < self.min_size:
params = {
'min_size': filesizeformat(self.mix_size),
'size': filesizeformat(data.size)
}
raise ValidationError(self.error_messages['min_size'],
'min_size', params)
if self.content_types:
content_type = magic.from_buffer(data.read(), mime=True)
data.seek(0)
if content_type not in self.content_types:
params = { 'content_type': content_type }
raise ValidationError(self.error_messages['content_type'],
'content_type', params)
def __eq__(self, other):
return (
isinstance(other, FileValidator) and
self.max_size == other.max_size and
self.min_size == other.min_size and
self.content_types == other.content_types
)
然后,您可以在FileValidator
或model.FileField
中使用forms.FileField
,如下所示:
validate_file = FileValidator(max_size=1024 * 100,
content_types=('application/xml',))
file = models.FileField(upload_to=settings.XML_ROOT,
validators=[validate_file])
答案 1 :(得分:14)
对于后代:解决方案是使用read
方法并将其传递给magic.from_buffer
。
class UploadedFileForm(ModelForm):
def clean_file(self):
file = self.cleaned_data.get("file", False)
filetype = magic.from_buffer(file.read())
if not "XML" in filetype:
raise ValidationError("File is not XML.")
return file
class Meta:
model = models.UploadedFile
exclude = ('project',)
答案 2 :(得分:11)
从django 1.11开始,您还可以使用FileExtensionValidator。
from django.core.validators import FileExtensionValidator
class UploadedFile(models.Model):
file = models.FileField(upload_to=settings.XML_ROOT,
validators=[FileExtensionValidator(allowed_extensions=['xml'])])
请注意,这必须在FileField上使用,并且不能在CharField上工作(例如),因为验证器在value.name上验证。
参考:https://docs.djangoproject.com/en/dev/ref/validators/#fileextensionvalidator
答案 3 :(得分:4)
我认为你想要做的是用Django的Form.clean_your_field_name_here()
方法清理上传的文件 - 如果它是作为普通的HTTP POST请求提交的话,那么系统上的数据是可用的。
此外,如果您认为这种效率低下探索不同Django文件上传后端的选项以及如何进行流处理。
如果在处理上传时需要考虑系统的安全性
确保上传的文件具有正确的扩展名
确保mimetype与文件扩展名
如果您担心用户上传漏洞利用文件(用于攻击您的网站)
在保存时重写所有文件内容以消除可能的额外(漏洞利用)有效负载(因此您无法在浏览器在下载时将HTML解释为站点源HTML文件的XML中嵌入HTML)
确保您在下载时使用内容处置标题
此处提供更多信息:http://opensourcehacker.com/2013/07/31/secure-user-uploads-and-exploiting-served-user-content/
以下是我如何清理上传图片的示例:
class Example(models.Model):
image = models.ImageField(upload_to=filename_gen("participant-images/"), blank=True, null=True)
class Example(forms.ModelForm):
def clean_image(self):
""" Clean the uploaded image attachemnt.
"""
image = self.cleaned_data.get('image', False)
utils.ensure_safe_user_image(image)
return image
def ensure_safe_user_image(image):
""" Perform various checks to sanitize user uploaded image data.
Checks that image was valid header, then
:param: InMemoryUploadedFile instance (Django form field value)
:raise: ValidationError in the case the image content has issues
"""
if not image:
return
assert isinstance(image, InMemoryUploadedFile), "Image rewrite has been only tested on in-memory upload backend"
# Make sure the image is not too big, so that PIL trashes the server
if image:
if image._size > 4*1024*1024:
raise ValidationError("Image file too large - the limit is 4 megabytes")
# Then do header peak what the image claims
image.file.seek(0)
mime = magic.from_buffer(image.file.getvalue(), mime=True)
if mime not in ("image/png", "image/jpeg"):
raise ValidationError("Image is not valid. Please upload a JPEG or PNG image.")
doc_type = mime.split("/")[-1].upper()
# Read data from cStringIO instance
image.file.seek(0)
pil_image = Image.open(image.file)
# Rewrite the image contents in the memory
# (bails out with exception on bad data)
buf = StringIO()
pil_image.thumbnail((2048, 2048), Image.ANTIALIAS)
pil_image.save(buf, doc_type)
image.file = buf
# Make sure the image has valid extension (can't upload .htm image)
extension = unicode(doc_type.lower())
if not image.name.endswith(u".%s" % extension):
image.name = image.name + u"." + extension
答案 4 :(得分:1)
我找到了一个有趣的程序包,最近可以执行上传文件验证。您可以看到the package here。打包方法与苏丹回答类似,因此我们可以立即实施。
{
"query":{
"bool":{
"should":{
"match":{
"userid":"9fe3ba41-780f-448d-8c99-c0440a7ba3f0"
}
}
}
}
}