检查文本文件的内容,如果不存在则添加内容

时间:2019-03-29 13:09:54

标签: python python-3.x string string-comparison

我是python的新手。我正在使用python设计报价应用程序。我正在使用BeautifulSoup从聪明的报价网站上获得当天的报价。我会将其附加到文本文件中。在这里,如果已经添加了当天的报价,那么当我再次执行该程序时,应该跳过它。如何使其成为可能

代码如下:

from bs4 import BeautifulSoup
import socket
import requests
import subprocess
import datetime
def quotenotify():
    timestamp = datetime.datetime.now().strftime("%b %d")
    res = requests.get('https://www.brainyquote.com/quote_of_the_day')
    soup = BeautifulSoup(res.text, 'lxml')

    image_quote = soup.find('img', {'class': 'p-qotd bqPhotoDefault bqPhotoDefaultFw img-responsive delayedPhotoLoad'})
    quoteday=image_quote['alt']
    text_file = open("quotes.log", "a+")
    text_file.write("%s"%timestamp+"\t"+"%s"% quoteday)
    text_file.write("\n")
    text_file.close()
    return
quotenotify()

在文件中输出:

Mar 29  Where there is a great love, there are always wishes. - Willa Cather
Mar 29  Where there is great love, there are always wishes. - Willa Cather

2 个答案:

答案 0 :(得分:1)

从评论继续:

from bs4 import BeautifulSoup
import requests
import datetime

def quotenotify():
    timestamp = datetime.datetime.now().strftime("%b %d")
    res = requests.get('https://www.brainyquote.com/quote_of_the_day')
    soup = BeautifulSoup(res.text, 'lxml')
    image_quote = soup.find('img', {'class': 'p-qotd bqPhotoDefault bqPhotoDefaultFw img-responsive delayedPhotoLoad'})['alt']
    with open("quotes.log", "w+") as f:
        if image_quote not in f.read():
            f.write("%s"%timestamp+"\t"+"%s"% image_quote + "\n")

quotenotify()

编辑

由于使用模式w+会截断文件,因此建议使用pathlib:

from bs4 import BeautifulSoup
import requests
import datetime
from pathlib import Path

def quotenotify():
    timestamp = datetime.datetime.now().strftime("%b %d")
    res = requests.get('https://www.brainyquote.com/quote_of_the_day')
    soup = BeautifulSoup(res.text, 'lxml')
    image_quote = timestamp + "\t" + soup.find('img', {'class': 'p-qotd bqPhotoDefault bqPhotoDefaultFw img-responsive delayedPhotoLoad'})['alt']
    with open("quotes3.log", "a+") as f:
        contents = [Path("quotes3.log").read_text()]
        print(contents)
        print(image_quote)
        if image_quote not in contents:
            f.write("%s" % timestamp + "\t" + "%s" % image_quote + "\n")

quotenotify()

答案 1 :(得分:0)

如@DirtyBit所述,您应该首先以读取模式打开文件,然后将内容加载到变量中。

您可以在下面的示例中看到,我将内容加载到变量上,然后仅在变量不在文本文件中时才将其追加到文件中。

$doc = New-Object System.Xml.XmlDocument
$doc.LoadXml((Invoke-WebRequest -Uri $url).Content)