Question

将此Javascript代码用作Acrobat Pro中的Action：

// Iterates over all pages and find a given string and extracts all 
// pages on which that string is found to a new file.

var pageArray = [];

var stringToSearchFor = "USA";

for (var p = 0; p < this.numPages; p++) {
	// iterate over all words
	for (var n = 0; n < this.getPageNumWords(p); n++) {
		if (this.getPageNthWord(p, n) == stringToSearchFor) {
			pageArray.push(p);
			break;
		}
	}
}

if (pageArray.length > 0) {
	// extract all pages that contain the string into a new document
	var d = app.newDoc();    // this will add a blank page - we need to remove that once we are done
	for (var n = 0; n < pageArray.length; n++) {
		d.insertPages( {
			nPage: d.numPages-1,
			cPath: this.path,
			nStart: pageArray[n],
			nEnd: pageArray[n],
		} );
	}

    // remove the first page
    d.deletePages(0);
    
}

当stringToSearchFor变量设置为“USA”时，按预期执行

但是当该字符串更改为我想要匹配的实际文本时失败，“美国”。每次初始后都有句号。

我试过逃避。字符并将它们设置为通配符*并将字符串更改为RegEx模式无效。

当PDF和变量中的文本都是美国时，该操作会执行它应该执行的操作。

当PDF和变量都是U.S.A时，脚本会运行但不会提取任何页面。

当字符串包含句点

0 个答案: