使用boto3时S3连接超时

时间:2016-12-21 12:55:42

标签: python amazon-web-services amazon-s3 boto3

我正在使用boto3来操作S3。如果我的应用程序由于网络问题而无法访问S3,则连接将挂起,直到最终超时。我想设置较低的连接超时。我遇到了this PR for botocore,允许设置超时:

$ sudo iptables -A OUTPUT -p tcp --dport 443 -j DROP

from botocore.client import Config
import boto3

config = Config(connect_timeout=5, read_timeout=5)

s3 = boto3.client('s3', config=config)

s3.head_bucket(Bucket='my-s3-bucket') 

这会抛出一个ConnectTimeout,但错误输出仍然需要很长时间:

ConnectTimeout: HTTPSConnectionPool(host='my-s3-bucket.s3.amazonaws.com', port=443): Max retries exceeded with url: / (Caused by ConnectTimeoutError(<botocore.awsrequest.AWSHTTPSConnection object at 0x2ad5dd0>, 'Connection to my-s3-bucket.s3.amazonaws.com timed out. (connect timeout=5)'))

调整连接和读取超时并不会影响连接响应的速度。

2 个答案:

答案 0 :(得分:10)

你可能会被boto3多次重试连接的默认行为所困扰,并且在两者之间以指数方式退出。我的结果很好:

config = Config(connect_timeout=5, retries={'max_attempts': 0})

答案 1 :(得分:0)

你有没有解决这个问题?我怀疑你需要你的boto连接的凭证。

以下是我连接到boto3的方式:

import boto3
from botocore.exceptions import ClientError
import re
from io import BytesIO
import gzip
import datetime
import dateutil.parser as dparser
from datetime import datetime
import tarfile
import requests
import sys
from awsglue.transforms import *
from awsglue.utils import getResolvedOptions
from pyspark.context import SparkContext
from awsglue.context import GlueContext
from awsglue.job import Job

## Needed glue stuff
sc = SparkContext()
glueContext = GlueContext(sc)
spark = glueContext.spark_session
job = Job(glueContext)

## 
## currently this will run for everything that is in the staging directory         of omniture

# set needed parms
myProfileName = 'MyDataLake'
dhiBucket = 'data-lake'
#create boto3 session
try:    
    session = boto3.Session(aws_access_key_id='aaaaaaaaaaaa', aws_secret_access_key='abcdefghijklmnopqrstuvwxyz', region_name='us-east-    1')aws_session_token=None, region_name=None, botocore_session=None
s3 = session.resource('s3') #establish connection to s3
except Exception as conne:
    print ("Unable to connect:  " + str(conne))
    errtxt = requests.post("https://errorcapturesite", data=    {'message':'Unable to connect to : ' + myProfileName,     'notify':True,'color':'red'})
    print(errtxt.text) 
    exit()