我正在努力确定计算生物学出版物的当前趋势。我想访问PubMed,获取期刊列表并访问2017年发布的所有文章。接下来,我希望获得与这些文章相关的所有关键词。
问题:我需要帮助修改代码,因此我提取2017年发布的所有文章,而不仅仅是最新的文章。
从Biopython加载Entrez和Medline
from Bio import Entrez
from Bio import Medline
Entrez.email = "my@email.com"
我感兴趣的期刊列表
JTList = ["Bioinformatics", "Bioinformatics and Biology Insights", "BMC Bioinformatics", "Briefings in Bioinformatics",
"Bulletin of Mathematical Biology", "Cancer Informatics",
"Computational and Structural Biotechnology Journal", "Computational Biology and Chemistry",
"Computers in Biology and Medicine", "Current Bioinformatics", "Database", "EMBnet.journal",
"Evolutionary Bioinformatics", "IEEE/ACM Transactions on Computational Biology and Bioinformatics",
"International Journal of Functional Informatics and Personalized Medicine",
"Journal of Bioinformatics and Computational Biology", "Journal of Biomedical Informatics",
"Journal of Computational Biology", "Journal of Mathematical Biology", "Journal of Theoretical Biology",
"PLoS Computational Biology", "Rapid Communications in Mass Spectrometry",
"Source Code for Biology and Medicine", "Statistical Applications in Genetics and Molecular Biology"]
**这可以在一本期刊中获得最新的20篇文章。我希望这能获取2017年的所有文章以及我列表中的所有期刊。
handle = Entrez.esearch(db="pubmed", term="Briefings in Bioinformatics[JOUR]")
record = Entrez.read(handle)
现在我获取所有文章ID并提取其关键字。
IDList = []
for journal in JTList:
jour = journal + "[JOUR]"
handle = Entrez.esearch(db="pubmed", term=jour)
record = Entrez.read(handle)
IDList.extend(record["IdList"])
keywrds = []
for record in records:
keywrds.extend(record.get("OT", "?"))
感谢您的帮助。我欢迎所有改进此代码的建议。