我可以在mapreduce作业中为我的映射器提供自定义输入吗?

时间:2014-05-24 06:15:51

标签: hadoop mapreduce bigdata logfile-analysis

如果你想用mapreduce处理文本文件或日志文件,默认情况下mapper输入值ll b一行到新行字符但是我想处理一个日志文件

A (18:18:43,703 [ERROR] AccStmtGeneratorImpl - Statement not generated for::304005)

B (11:40:04,202 [ERROR] JobStoreTX - Failed to close Connection
 java.sql.SQLException: Already closed.
    at org.apache.commons.dbcp.PoolableConnection.close(PoolableConnection.java:114)
    at `enter code here`org.apache.commons.dbcp.PoolingDataSource$PoolGuardConnectionWrapper.close(PoolingDataSource.java:191)
    at org.quartz.impl.jdbcjobstore.JobStoreSupport.closeConnection(JobStoreSupport.java:3579)
    at org.quartz.impl.jdbcjobstore.JobStoreSupport.cleanupConnection(JobStoreSupport.java:3555)
    at org.quartz.impl.jdbcjobstore.JobStoreSupport.executeInNonManagedTXLock(JobStoreSupport.java:3783)
    at org.quartz.impl.jdbcjobstore.JobStoreSupport.acquireNextTrigger(JobStoreSupport.java:2728)
    at org.quartz.core.QuartzSchedulerThread.run(QuartzSchedulerThread.java:264)
)

这里我的mapper应该从11:40:04,202取值到新的时间戳意味着A' ll是一个值而整个B ll b另一个值 那怎么能这样做..? 提前致谢 请帮忙......

0 个答案:

没有答案