使用zappa部署到AWS Lambda时,spaCy会抛出OSError

时间:2018-03-09 05:02:44

标签: python aws-lambda spacy zappa

将Python spaCy应用程序部署到AWS Lambda时,我在部署中遇到以下错误(请参阅下文)。为什么要使用zappa进行部署? zip文件压缩为125MB,因此从aws-cli直接上传到空间失败,并且转移到S3也会失败,因为未压缩的文件超过250MB。

我的程序本身没有进行任何多线程处理,也没有多处理,它只使用spaCy 2.0。我在EC2 AWS Linux t2.medium上构建和部署。从spaCy AWS Lambda函数获得往返答案的具体步骤是什么?

下面的失败追踪:

[1520570028387] Failed to find library...right filename?
[1520570029826] [Errno 38] Function not implemented: OSError
Traceback (most recent call last):
  File "/var/task/handler.py", line 509, in lambda_handler
  return LambdaHandler.lambda_handler(event, context)
  File "/var/task/handler.py", line 237, in lambda_handler
  handler = cls()
  File "/var/task/handler.py", line 129, in __init__
  self.app_module = importlib.import_module(self.settings.APP_MODULE)
  File "/var/lang/lib/python3.6/importlib/__init__.py", line 126, in import_module
  return _bootstrap._gcd_import(name[level:], package, level)
  File "<frozen importlib._bootstrap>", line 978, in _gcd_import
  File "<frozen importlib._bootstrap>", line 961, in _find_and_load
  File "<frozen importlib._bootstrap>", line 950, in _find_and_load_unlocked
  File "<frozen importlib._bootstrap>", line 655, in _load_unlocked
  File "<frozen importlib._bootstrap_external>", line 678, in exec_module
  File "<frozen importlib._bootstrap>", line 205, in _call_with_frames_removed
  File "/tmp/spaciness/front.py", line 1, in <module>
  import spacy
  File "/tmp/spaciness/spacy/__init__.py", line 4, in <module>
  from .cli.info import info as cli_info
  File "/tmp/spaciness/spacy/cli/__init__.py", line 1, in <module>
  from .download import download
  File "/tmp/spaciness/spacy/cli/download.py", line 10, in <module>
  from .link import link
  File "/tmp/spaciness/spacy/cli/link.py", line 7, in <module>
  from ..compat import symlink_to, path2str
  File "/tmp/spaciness/spacy/compat.py", line 11, in <module>
  from thinc.neural.util import copy_array
  File "/tmp/spaciness/thinc/neural/__init__.py", line 1, in <module>
  from ._classes.model import Model
  File "/tmp/spaciness/thinc/neural/_classes/model.py", line 12, in <module>
  from ..train import Trainer
  File "/tmp/spaciness/thinc/neural/train.py", line 7, in <module>
  from tqdm import tqdm
  File "/tmp/spaciness/tqdm/__init__.py", line 1, in <module>
  from ._tqdm import tqdm
  File "/tmp/spaciness/tqdm/_tqdm.py", line 53, in <module>
  mp_lock = mp.Lock()  # multiprocessing lock
  File "/var/lang/lib/python3.6/multiprocessing/context.py", line 67, in Lock
  return Lock(ctx=self.get_context())
  File "/var/lang/lib/python3.6/multiprocessing/synchronize.py", line 163, in __init__
  SemLock.__init__(self, SEMAPHORE, 1, 1, ctx=ctx)
  File "/var/lang/lib/python3.6/multiprocessing/synchronize.py", line 60, in __init__
  unlink_now)
OSError: [Errno 38] Function not implemented

1 个答案:

答案 0 :(得分:1)

我可以通过以下步骤解决该问题:

  1. 增加zappa_settings.json中的lambda函数的内存大小:

    {     “ dev”:{

        "memory_size": 3008,
    }
    

    }

  2. 我必须使用更新版本的tqdm。默认情况下,版本为4.19,存在以下问题,如下所述:https://github.com/tqdm/tqdm/issues/466

所描述的问题已在较新的版本中修复。只需将tqdm添加到我的requirements.txt并对该软件包执行pip升级:

pip install -U tqdm

执行zappa deploy dev时,我得到以下消息:

  

(tqdm 4.32.1(/var/task/ve/lib/python3.6/site-packages)、Requirement.parse('tqdm==4.19.1')、{'zappa'})

tqdm 4.19.1是zappa的默认版本,而tqdm 4.32.1是包含该修补程序的新版本。