我想将一个文件夹中的多个文本文件(每个包含两列的文件)导入到一个Excel工作表中,以便每个新文件都在一个新列中启动。理想情况下,我需要第一个文件中的两列,以及每个附加文本文件中的第二列。 在powerquery中,我尝试使用"从文件夹导入(导入元数据和文件夹中文件的链接)"功能后跟查询编辑器并扩展二进制文件,结果是每个新文件都附加在前一个文件的末尾。但我希望每个文件都在同一张表格中开始一个新列,我不知道该怎么做。
我如何指导powerquery这样做? 在此先感谢您的帮助!
答案 0 :(得分:1)
我的提议包括通过高级编辑器添加的2个相当困难的步骤,但它对于文件夹中的.txt文件数量是动态的。我添加了大量的评论,所以它应该是自我解释的。
/* In this query, .txt files from a folder are combined.
Each source file has 2 columns.
The resulting table consists of both columns from the first file and each second column from the other files.
Tables are joined using each first column as key and with a left outer join
It is assumed that each file has column headers in the first row, that the first column header is the same for each file
and, preferably, the second column header differs per file, although this is not necessary.
This query is tested with the following file contents:
File1.txt:
ID,File1
1,A
2,B
3,C
4,D
File2.txt:
ID,File2
1,W
2,X
3,Y
Another file was added later on, to test for .txt files being added to the folder: works fine!
*/
let
// Standard UI:
Source = Folder.Files("C:\Users\Marcel\Documents\Forum bijdragen\StackOverflow Power Query\Multiple files in 1 folder"),
// Standard UI; step renamed
FilteredTxt = Table.SelectRows(Source, each [Extension] = ".txt"),
// Standard UI; step renamed
RemovedColumns = Table.RemoveColumns(FilteredTxt,{"Name", "Extension", "Date accessed", "Date modified", "Date created", "Attributes", "Folder Path"}),
// UI add custom column "FileContents" with formula Csv.Document([Content]); step renamed
AddedFileContents = Table.AddColumn(RemovedColumns, "FileContents", each Csv.Document([Content])),
// Standard UI; step renamed
RemovedBinaryContent = Table.RemoveColumns(AddedFileContents,{"Content"}),
// In the next 3 steps, temporary names for the new columns are created ("Column2", "Column3", etcetera)
// Standard UI: add custom Index column, start at 2, increment 1
#"Added Index" = Table.AddIndexColumn(RemovedBinaryContent, "Index", 2, 1),
// Standard UI: select Index column, Transform tab, Format, Add Prefix: "Column"
#"Added Prefix" = Table.TransformColumns(#"Added Index", {{"Index", each "Column" & Text.From(_, "en-US"), type text}}),
// Standard UI:
#"Renamed Columns" = Table.RenameColumns(#"Added Prefix",{{"Index", "ColumnName"}}),
// Now we have the names for the new columns
// Advanced Editor: create a list with records with FileContents (tables) and ColumnNames (text) (1 list item (or record) per txt file in the folder)
// From this list, the resulting table will be build in the next step.
ListOfRecords = Table.ToRecords(#"Renamed Columns"),
// Advanced Editor: use List.Accumulate to build the table with all columns,
// starting with Column1 of the first file (Table.FromList(ListOfRecords{0}[FileContents][Column1], each {_}),)
// adding Column2 of each file for all items in ListOfRecords.
BuildTable = List.Accumulate(ListOfRecords,
Table.FromList(ListOfRecords{0}[FileContents][Column1], each {_}),
(TableSoFar,NewColumn) =>
Table.ExpandTableColumn(Table.NestedJoin(TableSoFar, "Column1", NewColumn[FileContents], "Column1", "Dummy", JoinKind.LeftOuter), "Dummy", {"Column2"}, {NewColumn[ColumnName]})),
// Standard UI
#"Promoted Headers" = Table.PromoteHeaders(BuildTable)
in
#"Promoted Headers"