Question

我有很多这样的网络聊天记录：

 Agent: Click here. \n Then click here. \n Then do this. \r\n

 Customer: Show me. \r\n

 Agent: Ok. \n One Second. \r\n

当我将它们导入R时，我只想将\ r \ n视为新行，而不是\ n。我没有在readline（）中看到这样做的选项？有没有最好的方法呢？

以下是Python中Stack Overflow的解决方案：How to split only on carriage returns with readlines in python?

Answer 1

我调查了是否可以使用defaultdict(<class 'list'>, {'is': ['bright', 'dark'], 'moon': ['glows'], 'cat': ['meows'], 'glows': ['the'], 'meows': ['my'], 'crows': [], 'bright': ['the'], 'while': ['the'], 'the': ['sun', 'moon', 'dog','cat'], 'as': ['crows'], 'dog': ['barks', 'is'], 'sun': ['is'], 'my': ['dog'], 'dark': ['dark', 'as'], 'barks': ['while']})来执行此操作，但我没有看到使用readLines()执行此操作的简单方法。

似乎基函数readLines()在这里没用。

解决方法是使用扫描读取文本，然后解析结果。

例如，如果您的文本位于foo.txt：

readline()

然后你可以删除空格以获得类似于我想要的东西。

我希望我有一个更优雅的解决方案。

R中的读取线\ r \ n不是\ n

1 个答案: