正则表达式把数据转换成json格式(爬虫小工具)

代码如下:

import re

infos =
 '''
Host: fanyi.baidu.com
Connection: keep-alive
Content-Length: 116
Accept: */*
Origin: https://fanyi.baidu.com
X-Requested-With: XMLHttpRequest
User-Agent: Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/68.0.3440.106 Safari/537.36
Content-Type: application/x-www-form-urlencoded; charset=UTF-8
Referer: https://fanyi.baidu.com/?aldtype=16047
Accept-Encoding: gzip, deflate, br
Accept-Language: zh-CN,zh;q=0.9
Cookie: BAIDUID=2FB7B3626075530B845DE9CC038810D1:FG=1; BIDUPSID=2FB7B3626075530B845DE9CC038810D1; PSTM=1531705450; BDORZ=B490B5EBF6F3CD402E515D22BCDA1598; H_PS_PSSID=1422_21110_27400_26350_22158; REALTIME_TRANS_SWITCH=1; FANYI_WORD_SWITCH=1; HISTORY_SWITCH=1; SOUND_SPD_SWITCH=1; SOUND_PREFER_SWITCH=1; delPer=0; PSINO=5; locale=zh; pgv_pvi=862832640; pgv_si=s2658904064; from_lang_often=%5B%7B%22value%22%3A%22en%22%2C%22text%22%3A%22%u82F1%u8BED%22%7D%2C%7B%22value%22%3A%22zh%22%2C%22text%22%3A%22%u4E2D%u6587%22%7D%5D; to_lang_often=%5B%7B%22value%22%3A%22zh%22%2C%22text%22%3A%22%u4E2D%u6587%22%7D%2C%7B%22value%22%3A%22en%22%2C%22text%22%3A%22%u82F1%u8BED%22%7D%5D; Hm_lvt_64ecd82404c51e03dc91cb9e8c025574=1541037483,1541037833,1541038571,1541044024; Hm_lpvt_64ecd82404c51e03dc91cb9e8c025574=1541044024
'''

ret = re.sub(r"(.+?):(\s*)(.+)",r"'\1':'\3',",infos)
print('{'+ret+'}')

用的时候只需要把想要转换的数据复制到infos里面就可以了
得到结果:

{
'Host':'fanyi.baidu.com',
'Connection':'keep-alive',
'Content-Length':'116',
'Accept':'*/*',
'Origin':'https://fanyi.baidu.com',
'X-Requested-With':'XMLHttpRequest',
'User-Agent':'Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/68.0.3440.106 Safari/537.36',
'Content-Type':'application/x-www-form-urlencoded; charset=UTF-8',
'Referer':'https://fanyi.baidu.com/?aldtype=16047',
'Accept-Encoding':'gzip, deflate, br',
'Accept-Language':'zh-CN,zh;q=0.9',
'Cookie':'BAIDUID=2FB7B3626075530B845DE9CC038810D1:FG=1; BIDUPSID=2FB7B3626075530B845DE9CC038810D1; PSTM=1531705450; BDORZ=B490B5EBF6F3CD402E515D22BCDA1598; H_PS_PSSID=1422_21110_27400_26350_22158; REALTIME_TRANS_SWITCH=1; FANYI_WORD_SWITCH=1; HISTORY_SWITCH=1; SOUND_SPD_SWITCH=1; SOUND_PREFER_SWITCH=1; delPer=0; PSINO=5; locale=zh; pgv_pvi=862832640; pgv_si=s2658904064; from_lang_often=%5B%7B%22value%22%3A%22en%22%2C%22text%22%3A%22%u82F1%u8BED%22%7D%2C%7B%22value%22%3A%22zh%22%2C%22text%22%3A%22%u4E2D%u6587%22%7D%5D; to_lang_often=%5B%7B%22value%22%3A%22zh%22%2C%22text%22%3A%22%u4E2D%u6587%22%7D%2C%7B%22value%22%3A%22en%22%2C%22text%22%3A%22%u82F1%u8BED%22%7D%5D; Hm_lvt_64ecd82404c51e03dc91cb9e8c025574=1541037483,1541037833,1541038571,1541044024; Hm_lpvt_64ecd82404c51e03dc91cb9e8c025574=1541044024',
}

猜你喜欢

转载自blog.csdn.net/qq_34663267/article/details/83619489