使用SDK将大文件上传到OneDrive

时间:2019-05-23 07:52:30

标签: python onedrive

我想将一个巨大的文件(大于2GB)上传到OneDrive。

我已经尝试使用sdk网页(https://github.com/OneDrive/onedrive-sdk-python)中的代码

returned_item = client.item(drive='me', path=backupPath).children['photos.tgz'].upload_async('/Users/koot/photos.tgz')

尽管该代码适用于较小的文件,但在上传大文件时,我得到了:

BrokenPipeError:[Errno 32]管道损坏

requests.exceptions.ConnectionError :(“连接已中止。”,BrokenPipeError(32,“管道破损”))

1 个答案:

答案 0 :(得分:0)

SDK现在已弃用。 https://docs.microsoft.com/en-us/onedrive/developer/rest-api/api/driveitem_createuploadsession?view=odsp-graph-online 您可以使用Microsoft graph和OneDriveAPI将文件上传到OneDrive OneDriveAPI支持对小文件(<4MB)的简单上载,对大文件的可恢复的上载,您基本上可以在其中创建一个上载会话并一次上载大文件。

Github repo

这是我写来解决此问题的教程: https://dev.to/jsnmtr/automating-files-upload-to-microsoft-onedrive-unexpected-challenges-and-a-success-story-2ini

以下是用于上传大文件的代码:

#Creating an upload session
        upload_session = requests.post(onedrive_destination+"/"+file_name+":/createUploadSession", headers=headers).json()

        with open(file_path, 'rb') as f:
            total_file_size = os.path.getsize(file_path)
            chunk_size = 327680
            chunk_number = total_file_size//chunk_size
            chunk_leftover = total_file_size - chunk_size * chunk_number
            i = 0
            while True:
                chunk_data = f.read(chunk_size)
                start_index = i*chunk_size
                end_index = start_index + chunk_size
                #If end of file, break
                if not chunk_data:
                    break
                if i == chunk_number:
                    end_index = start_index + chunk_leftover
                #Setting the header with the appropriate chunk data location in the file
                headers = {'Content-Length':'{}'.format(chunk_size),'Content-Range':'bytes {}-{}/{}'.format(start_index, end_index-1, total_file_size)}
                #Upload one chunk at a time
                chunk_data_upload = requests.put(upload_session['uploadUrl'], data=chunk_data, headers=headers)
                print(chunk_data_upload)
                print(chunk_data_upload.json())
                i = i + 1