Question

我正在尝试使用Python的子进程模块。我需要的是将输入发送到第一个进程，其输出成为第二个进程的输入。情况基本上与此处文档中给出的示例相同： http://docs.python.org/library/subprocess.html#replacing-shell-pipeline 除了我需要提供输入第一个命令。这是复制的例子：

p1 = Popen(["dmesg"], stdout=PIPE)
p2 = Popen(["grep", "hda"], stdin=p1.stdout, stdout=PIPE)
p1.stdout.close()  # Allow p1 to receive a SIGPIPE if p2 exits.
output = p2.communicate()[0]

如果我们将第一行更改为：

p1 = Popen(["cat"], stdout=PIPE, stdin=PIPE)

如何为流程提供输入字符串？如果我通过将最后一行改为：

来尝试它

output = p2.communicate(input=inputstring)[0]

这不起作用。

我确实有一个工作版本，它只将第一个命令的输出存储在一个字符串中，然后将其传递给第二个命令。这并不可怕，因为基本上没有可以被利用的并发性（在我的实际用例中，第一个命令将很快退出并在最后生成所有输出）。以下是完整的工作版本：

import subprocess

simple = """Writing some text
with some lines in which the
word line occurs but others
where it does
not
"""

def run ():
  catcommand = [ "cat" ]
  catprocess = subprocess.Popen(catcommand,
                                stdin=subprocess.PIPE,
                                stdout=subprocess.PIPE,
                                stderr=subprocess.PIPE)
  (catout, caterr) = catprocess.communicate(input=simple)
  grepcommand = [ "grep", "line" ]
  grepprocess = subprocess.Popen(grepcommand,
                                stdin=subprocess.PIPE,
                                stdout=subprocess.PIPE,
                                stderr=subprocess.PIPE)
  (grepout, greperr) = grepprocess.communicate(input=catout)
  print "--- output ----"
  print grepout 
  print "--- error ----"
  print greperr 

if __name__ == "__main__":
  run()

我希望我已经足够清楚了，谢谢你的帮助。

Answer 1

如果你这样做

from subprocess import Popen, PIPE
p1 = Popen(["cat"], stdout=PIPE, stdin=PIPE)

你应该p1.communicate("Your Input to the p1")，然后流过PIPE。 stdin是进程的输入，你应该只与它通信。

给出的程序绝对没问题，似乎没有问题。

Answer 2

我认为cat，grep只是示例命令，否则您可以使用没有子进程的纯Python解决方案，例如：

for line in simple.splitlines():
    if "line" in line:
       print(line)

或者如果您想使用grep：

from subprocess import Popen, PIPE

output = Popen(['grep', 'line'], stdin=PIPE, stdout=PIPE).communicate(simple)[0]
print output,

您可以将第一个命令的输出传递给第二个命令，而不先将其存储在字符串中：

from subprocess import Popen, PIPE
from threading import Thread

# start commands in parallel
first = Popen(first_command, stdin=PIPE, stdout=PIPE)
second = Popen(second_command, stdin=first.stdout, stdout=PIPE)
first.stdout.close() # notify `first` if `second` exits 
first.stdout = None # avoid I/O on it in `.communicate()`

# feed input to the first command
Thread(target=first.communicate, args=[simple]).start() # avoid blocking

# get output from the second command at the same time
output = second.communicate()[0]
print output,

如果您不想将所有输入/输出存储在内存中;你可能需要线程（在没有阻塞的情况下以块的形式读/写）或者选择循环（在POSIX上工作）。

如果有多个命令，那么根据@Troels Folke的建议或使用a library such as plumbum that hides all the gory details of emulating the shell by hand直接使用shell可能更具可读性。

Answer 3

嗯，为什么不混合一点（ba）sh？： - ）

from subprocess import Popen, PIPE
cproc = Popen('cat | grep line', stdin=PIPE, stdout=PIPE, stderr=PIPE, shell=True)
out, err = cproc.communicate("this line has the word line in it")

请注意：

这仅适用于使用Bourne Shell兼容shell的系统（如大多数* nix'es）
使用shell = True并将用户输入放在命令字符串中是个坏主意，除非您首先转义用户输入。阅读子流程文档 - ＆gt; “常用参数”了解详情。
这是丑陋的，非便携式，非pythonic等...

编辑：但是，如果您只想做cat，则无需使用grep。只需将输入直接输入grep，甚至更好，使用python正则表达式。

Python子进程模块，如何为第一个管道命令系列提供输入？

3 个答案: