编码学习之URL 编码(百分号编码)

你肯定发现了,我们在浏览器中的请求,中文都会变成一个“奇怪的”的百分号字符

https://www.baidu.com/s?wd=%E6%98%A5%E5%A4%A9%E4%BD%A0%E5%A5%BD&rsv_spt=1&rsv_iqid=0x8cce01c500151159&issp=1&f=8&rsv_bp=1&rsv_idx=2&ie=utf-8&tn=baiduhome_pg&rsv_enter=0&rsv_dl=tb&rsv_sug3=14&rsv_sug1=6&rsv_sug7=100&rsv_btype=i&inputT=6888&rsv_sug4=7348w

根据网络标准RFC1738的规定

只有字母和数字[0-9a-zA-Z]、一些特殊符号$-_.+!*以及某些保留字,才可以不经过编码直接用于URL

url编码的规则特别简单,将需要转码的字符转成他们的UTF-8格式(可以替换),然后每一个Byte用两个16进制字符表示,每个Byte前面加一个%号

例如/的utf8是 2F

则url encode之后的就是%2F

我们看下Java和Android中预置的encode方法结果

        val source = "http://www.lukou.com?q=春天你好&referer=Hello"

        logcat(source)
        val encodeString = Uri.encode(source)
        logcat(encodeString)
        logcat(URLEncoder.encode(source,"UTF-8"))
        logcat(URLEncoder.encode(source,"GB2312"))
        logcat(Uri.decode(encodeString))

输出如下

2021-01-28 15:38:35.691 25375-25375/com.plbear.lxc E/imlog: http://www.lukou.com?q=春天你好&referer=Hello
2021-01-28 15:38:35.691 25375-25375/com.plbear.lxc E/imlog: http%3A%2F%2Fwww.lukou.com%3Fq%3D%E6%98%A5%E5%A4%A9%E4%BD%A0%E5%A5%BD%26referer%3DHello
2021-01-28 15:38:35.692 25375-25375/com.plbear.lxc E/imlog: http%3A%2F%2Fwww.lukou.com%3Fq%3D%E6%98%A5%E5%A4%A9%E4%BD%A0%E5%A5%BD%26referer%3DHello
2021-01-28 15:38:35.694 25375-25375/com.plbear.lxc E/imlog: http%3A%2F%2Fwww.lukou.com%3Fq%3D%B4%BA%CC%EC%C4%E3%BA%C3%26referer%3DHello
2021-01-28 15:38:35.694 25375-25375/com.plbear.lxc E/imlog: http://www.lukou.com?q=春天你好&referer=Hello

猜你喜欢

转载自blog.csdn.net/weixin_43662090/article/details/113340864