使用sed或awk从文件中删除多字节字符。

时间:2016-07-17 21:10:29

标签: unix awk sed

我需要在常规文件中复制所有多字节。

示例数据

  

1792834104797印象点击流页面FPTI
  1468294723126 257678 10692349
  5c7ee584-6f83-4062-b324-2219922e8848 mobile:msdk:preconnect ::   手机:MSDK:PRECONNECT :::安卓实况:
   7(内测工程机a511) Android 5.1 2.9.10 1792834104797
  session_start_gmt = 1468294723126&安培; session_start_page =移动:MSDK:PRECONNECT :::机器人:活:安培; session_end_gmt = 1468294723126&安培; session_end_page =移动:MSDK:PRECONNECT :::机器人:活:安培; mobile_carrier = CMCC&安培;的 mobile_app =优步    - 优步| 2.9.10 | com.ubercab& event_type = im& data_source = msdk& session_length_sec = 0.0& user_logged_in = out& eap_source_path = hdfs:// stampy / sys / dt / fpti / v1 / seq / enrichedevents / 2016 / 7月11日/ 20 /部分-M-00017&安培; user_guid = 5c7ee584-6f83-4062-b324-2219922e8848&安培; CLIENT_ID = AdtlNBDhgmQWi2xk6edqJVKklPFyDWxtyKuXuyVT-OgdnnKpAVsbKHgvqHHP&安培; GEO_CITY =温州&安培; session_id_cookie_id = 10692349-5c7ee584-6f83-4062-b324-2219922e8848&安培; bot_flag_yn = N'安培; epoch_timestamp_client = 1468294684561&安培; server_businessname =移动&安培; unique_message_id = dd2f76361550a4a9c5b18125feef485d&安培; page_version =机器人:活:安培; DEVICE_ID = 5c7ee584-6f83-4062-b324-2219922e8848&安培; business_channel_name = MSDK&安培; user_session_guid = 10692349&安培; rosetta_language = zh_CN的&安培; geo_state =浙江&安培; IP_ADDRESS = 36.251.188.153&安培; GEO_REGION =浙江&安培; gmt_offset_minutes = 480&安培; EVENT_TIME = 1468294723126&安培; geo_cntry = CN&安培; GEO_COUNTRY = CN&安培; Web_Session_bot_flag_yn = N&安培; Web_Session_session_mobile_app_version = 2.9.10&安培;网络_Session_session_mobile_device = 7 (内测工程机A511)的&安培;我们 b_Session_session_mobile_os = Android的   5.1 1807172862537 session_start_gmt = 1468715369910& session_start_page = mobile:consapp:apptransition :: didfinishlaunching:android ::& session_end_gmt = 1468716146342& session_end_page = mobile:consapp:apptransition :: willenterbackground:android ::& mobile_carrier = Sprint& client_os = Android& is_bot =假安培; DEVICE_TYPE =移动   电话及安培; EVENT_TYPE = IM&安培; session_length_sec = 0.0&安培; prospect_type = 0&安培; DEVICE_NAME =冲刺   Galaxy S6   (SM-G920P)及eap_source_path = HDFS:// stampy / SYS / DT / fpti / V1 / SEQ / enrichedevents / 2016/07/16/17 /部分-M-00026&安培; user_guid = 7397ebff63b98126&安培; GEO_CITY =塞科克&安培; geo_zip = 02771&安培; device_timestamp = 1468715590595&安培; session_id_cookie_id = 59906919-7397ebff63b98126&安培; encrypted_customer_id = J4AN28RAWDY46&安培; bot_flag_yn = N&安培; epoch_timestamp_client = 1468715590595&安培; mobile_device_id = 7397ebff63b98126&安培; unique_message_id = f645670c1550a4a9e9f67ceefc230a85&安培; user_session_guid = 59906919&安培;的 SEARCH_KEYWORD =   &安培; rosetta_language = EN_US&安培; geo_state = NULL&安培; IP_ADDRESS = 66.87.124.151&安培; browser_type = Android的   浏览器与GEO_REGION = MA&安培; gmt_offset_minutes = -240&安培; EVENT_TIME = 1468715591436&安培; geo_cntry = US&安培; GEO_COUNTRY = US&安培; Web_Session_bot_flag_yn = N&安培; Web_Session_session_mobile_app_version = 6.4.2&安培; Web_Session_session_mobile_device = SM-G920P&安培; Web_Session_session_mobile_os = Android的   6.0.1

我试着像下面那样替换,但没有运气。任何人都可以请帮助

LANG=C sed 's/[\x80-\xFF]//g'  filename

sed 's/^M//g' filename

sed 's/'"$(printf '\015')"'//g' filename 

提前致谢!

0 个答案:

没有答案