我有一个文件,其中的单词用“ |”定界。在这里,我需要搜索日期“ 20180603”。但是,我无法将值硬编码为搜索。日期格式是固定的YYYYDDMM,日期可以是任何东西。 我需要将此处显示的日期转换为今天的日期(系统日期)。
我正在粘贴外部文件的外观(仅在相关值周围添加了星号以强调):
00000548|WILLIAM|HUBER|WH5718||N|**20180306**|SVP-TECHNICAL FIELD SERVICES|06|329000.00 |0.00 |0.00 |205440.00 |0.00 |0.00 |0.00 |0.00 |0.00 |55000.00 |0.00 |0.00 |0.00 |1600.00 |0.00 |0.00 |0.00 |0.00 |225502.08 |0.00 |0.00 |0.00 |27629.91 |36717.17 |0.00 |33.000 |0.000 |F
00000828|NORBERTA|NOGUERA|NN1413||N|**20180306**|VP-SPECIAL PROJECTS|05|213000.00 |0.00 |88464.00 |0.00 |0.00 |0.00 |0.00 |0.00 |0.00 |0.00 |0.00 |0.00 |0.00 |0.00 |0.00 |0.00 |0.00 |0.00 |86502.87 |0.00 |0.00 |0.00 |16811.41 |15023.40 |0.00 |33.000 |0.000 |F
00001308|ERROL|PHIPPS|EP4499||N|00000548|WILLIAM|HUBER|WH5718||N|20180306|SVP-TECHNICAL FIELD SERVICES|06|329000.00 |0.00 |0.00 |205440.00 |0.00 |0.00 |0.00 |0.00 |0.00 |55000.00 |0.00 |0.00 |0.00 |1600.00 |0.00 |0.00 |0.00 |0.00 |225502.08 |0.00 |0.00 |0.00 |27629.91 |36717.17 |0.00 |33.000 |0.000 |F
00000828|NORBERTA|NOGUERA|NN1413||N|**20180306**|VP-SPECIAL PROJECTS|05|213000.00 |0.00 |88464.00 |0.00 |0.00 |0.00 |0.00 |0.00 |0.00 |0.00 |0.00 |0.00 |0.00 |0.00 |0.00 |0.00 |0.00 |0.00 |86502.87 |0.00 |0.00 |0.00 |16811.41 |15023.40 |0.00 |33.000 |0.000 |F
00001308|ERROL|PHIPPS|EP4499||N|**20180306**|VP-LEGAL BUSINESS HEAD|05|241000.00 |0.00 |94365.00 |0.00 |0.00 ||VP-LEGAL BUSINESS HEAD|05|241000.00 |0.00 |94365.00 |0.00 |0.00 |
我尝试了很多问题,但是没有运气。
下面是我尝试过的代码;
public class ReadFile {
public static void main(String[] args) throws IOException {
File f1= new File("C:/Users/kumar.sushobhan/Desktop/ESPYTR_Big_file_EXEC.dat");
//File f1= new File("C:/Users/kumar.sushobhan/Desktop/h.txt");
String words[]= null;
FileReader fr= new FileReader(f1);
BufferedReader br= new BufferedReader(fr);
String s;
int c = 0;
String regex= "\\d{4}\\d{2}\\d{2}";
while((s= br.readLine())!=null)
{
words= s.split("|");
for(String word: words)
{
//System.out.println(word);
if(word.equals(regex))
{
c++;
}
}
}
System.out.println(c);
fr.close();
}
}
我希望可以读取快照中存在的日期并将其更改为当前系统日期。
答案 0 :(得分:1)
这是一种基本算法,它将在管道定界文件中查找,用当前日期替换“看起来像”日期的值,然后将所有内容写回到新文件中。它使用您在问题中描述的YYYYDDMM
格式,但可能应该为YYYYMMDD
,并且我已经指出您需要在哪里进行更改。这与日期验证和错误处理相得益彰,试图将其保持得相对较短,但我为评论所有尝试过分了:
import java.io.BufferedReader;
import java.io.BufferedWriter;
import java.nio.file.Files;
import java.nio.file.Path;
import java.nio.file.Paths;
import java.time.LocalDate;
import java.time.format.DateTimeFormatter;
import java.util.regex.Matcher;
import java.util.regex.Pattern;
public class DateReplacer
{
private static final Pattern DATE_MATCHER =
Pattern.compile("(?:(?:19|20)[0-9]{2})([0-9]{2})([0-9]{2})");
public static void main(String... args)
throws Exception
{
// These are the paths to our input and output files
Path input = Paths.get("input.dat");
Path output = Paths.get("output.dat");
// We need to get today's date in YYYYDDMM format, so we create a
// DateFormatter for that. If it turns out that your date format is
// actually YYYYMMDD, you can just use DateFormatter.BASIC_ISO_DATE
// instead.
DateTimeFormatter formatter = DateTimeFormatter.ofPattern("yyyyddMM");
String todaysDate = LocalDate.now().format(formatter);
// Use try-with-resources to create a reader & writer
try (BufferedReader reader = Files.newBufferedReader(input);
BufferedWriter writer = Files.newBufferedWriter(output)) {
String line;
// Read lines until there are no more lines
while ((line = reader.readLine()) != null) {
// Split them on the | character, notice that it needs to be
// escaped because it is a regex metacharacter
String[] columns = line.split("\\|");
// Iterate over every column...
for (int i = 0; i < columns.length; i++) {
// ... and if the value looks like a date ...
if (isDateLike(columns[i])) {
// ... overwrite with today's date.
columns[i] = todaysDate;
}
}
// Re-join the columns with the | character and write it out
writer.write(String.join("|", columns));
writer.newLine();
}
}
}
private static boolean isDateLike(String str)
{
// Avoid the regular expression if we can
if (str.length() != 8) {
return false;
}
Matcher matcher = DATE_MATCHER.matcher(str);
if (matcher.matches()) {
// If it turns out that your date format is actually YYYYMMDD
// you will need to swap these two lines.
int day = Integer.parseInt(matcher.group(1), 10);
int month = Integer.parseInt(matcher.group(2), 10);
// We don't need to validate year because we already know
// it is between 1900 and 2099 inclusive
return day >= 1 && day <= 31 && month >= 1 && month <= 12;
}
return false;
}
}
此示例使用a try-with-resources
statement来确保正确关闭输入和输出文件。
答案 1 :(得分:0)
您可以使用如下正则表达式。
String regex = "(19|20)[0-9][0-9](0[1-9]|1[0-2])(0[1-9]|1[0-9]|2[0-9]|30|31)";
它并不完美,但可以匹配大多数日期。例如,它将消除月份超过12的日期。此外,它将适用于直到2099年的日期。它不会处理类似于6月有30天的日期规则。它可以匹配日期在1到31之间的任何日期。
您不能使用equals
作为日期。您将必须使用Pattern.matches(regex, string)
答案 2 :(得分:-2)
使用String中的contains方法。
s.contains(“ 20180603”)