[转] Request 接收参数乱码原理解析

首页 > 代码库 > [转] Request 接收参数乱码原理解析

[转] Request 接收参数乱码原理解析

2024-08-01 22:42:25 216人阅读

起因：

今天早上被同事问了一个问题：说接收到的参数是乱码，让我帮着解决一下。

实际情景：

同事负责的平台是Ext.js框架搭建的，web.config配置文件里配置了全局为“GB2312”编码:

<globalization requestEncoding="gb2312" responseEncoding="gb2312" fileEncoding="gb2312" culture="zh-CN"/>

当前台提交“中文文字”时，后台用Request.QueryString["xxx"]接收到的是乱码。

无论用System.Web.HttpUtility.UrlDecode("xxx","编码类型")怎么解码都无效。

原理说明：

1：首先确定的是：客户端的url参数在提交时，Ext.js 会对其编码再提交，而客户端的编码默认是 utf-8 编码

客户端默认有三种编码函数：escape() encodeURI() encodeURIComponent()

2：那为什么用 Request.QueryString["xxx"] 接收参数时，收到的会是乱码？

为此，我们必须解开 Request.QueryString 的原始处理逻辑过程

我们步步反编绎，

2.1：看 QueryString 属性的代码：

 1 public NameValueCollection QueryString 2 { 3     get 4     { 5         if (this._queryString == null) 6         { 7             this._queryString = new HttpValueCollection(); 8             if (this._wr != null) 9             {10                 this.FillInQueryStringCollection();//重点代码切入点11             }12             this._queryString.MakeReadOnly();13         }14         if (this._flags[1])15         {16             this._flags.Clear(1);17             ValidateNameValueCollection(this._queryString, "Request.QueryString");18         }19         return this._queryString;20     }21 }

2.2：切入 FillInQueryStringCollection()方法

 1 private void FillInQueryStringCollection() 2 { 3     byte[] queryStringBytes = this.QueryStringBytes; 4     if (queryStringBytes != null) 5     { 6         if (queryStringBytes.Length != 0) 7         { 8             this._queryString.FillFromEncodedBytes(queryStringBytes, this.QueryStringEncoding); 9         }10     }//上面是对流字节的处理，即文件上传之类的。11     else if (!string.IsNullOrEmpty(this.QueryStringText))12     {13         //下面这句是对普通文件提交的处理:FillFromString是个切入点,编码切入点是：this.QueryStringEncoding14         this._queryString.FillFromString(this.QueryStringText, true, this.QueryStringEncoding);15         16     }17 }

2.3：切入：QueryStringEncoding

 1 internal Encoding QueryStringEncoding 2 { 3     get 4     { 5         Encoding contentEncoding = this.ContentEncoding; 6         if (!contentEncoding.Equals(Encoding.Unicode)) 7         { 8             return contentEncoding; 9         }10         return Encoding.UTF8;11     }12 }13 //点击进入this.ContentEncoding则为：14 public Encoding ContentEncoding15 {16     get17     {18         if (!this._flags[0x20] || (this._encoding == null))19         {20             this._encoding = this.GetEncodingFromHeaders();21             if (this._encoding == null)22             {23                 GlobalizationSection globalization = RuntimeConfig.GetLKGConfig(this._context).Globalization;24                 this._encoding = globalization.RequestEncoding;25             }26             this._flags.Set(0x20);27         }28         return this._encoding;29     }30     set31     {32         this._encoding = value;33         this._flags.Set(0x20);34     }35 }

说明：

从QueryStringEncoding代码得出，系统默认会先取globalization配置节点的编码方式，如果取不到，则默认为UTF-8编码方式

2.4：切入 FillFromString(string s, bool urlencoded, Encoding encoding)

 1 internal void FillFromString(string s, bool urlencoded, Encoding encoding) 2 { 3     int num = (s != null) ? s.Length : 0; 4     for (int i = 0; i < num; i++) 5     { 6         int startIndex = i; 7         int num4 = -1; 8         while (i < num) 9         {10             char ch = s[i];11             if (ch == ‘=‘)12             {13                 if (num4 < 0)14                 {15                     num4 = i;16                 }17             }18             else if (ch == ‘&‘)19             {20                 break;21             }22             i++;23         }24         string str = null;25         string str2 = null;26         if (num4 >= 0)27         {28             str = s.Substring(startIndex, num4 - startIndex);29             str2 = s.Substring(num4 + 1, (i - num4) - 1);30         }31         else32         {33             str2 = s.Substring(startIndex, i - startIndex);34         }35         if (urlencoded)//外面的传值默认是true,所以会执行以下语句36         {37             base.Add(HttpUtility.UrlDecode(str, encoding), HttpUtility.UrlDecode(str2, encoding));38         }39         else40         {41             base.Add(str, str2);42         }43         if ((i == (num - 1)) && (s[i] == ‘&‘))44         {45             base.Add(null, string.Empty);46         }47     }48 }

说明：

从这点我们发现：所有的参数输入，都调用了一次：HttpUtility.UrlDecode(str2, encoding);

3：结论出来了

当客户端js对中文以utf-8编码提交到服务端时，用Request.QueryString接收时，会先以globalization配置的gb2312去解码一次，于是，产生了乱码。

所有的起因为：

1：js编码方式为urt-8

2：服务端又配置了默认为gb2312

3：Request.QueryString默认又会调用HttpUtility.UrlDecode用系统配置编码去解码接收参数。

文章补充：

1：系统取默认编码的顺序为：http请求头->globalization配置节点-》默认UTF-82：在Url直接输入中文时，不同浏览器处理方式可能不同如：ie不进行编码直接提交，firefox对url进行gb2312编码后提交。3：对于未编码“中文字符”，使用Request.QueryString时内部调用HttpUtility.UrlDecode后，由gb2312->utf-8时，如果查不到该中文字符，默认转成"%ufffd"，因此出现不可逆乱码。

4：解决之路

知道了原理，解决的方式也有多种多样了：

a：全局统一为 UTF-8 编码，省事又省心。

b：全局指定了 GB2312 编码时，url 带中文，js 非编码不可，如 ext.js 框架。

这种方式你只能特殊处理，在服务端指定编码解码，
因为默认系统调用了一次HttpUtility.UrlDecode（"xxx",系统配置的编码)，
因此你再调用一次HttpUtility.UrlEncode（"xxx",系统配置的编码)，返回到原始urt-8编码参数
再用HttpUtility.UrlDecode（"xxx",utf-8)，解码即可。

5：其它说明：默认对进行一次解码的还包括 URI 属性,而 Request.RawUrl 则为原始参数

[转] Request 接收参数乱码原理解析

声明：以上内容来自用户投稿及互联网公开渠道收集整理发布，本网站不拥有所有权，未作人工编辑处理，也不承担相关法律责任，若内容有误或涉及侵权可进行投诉：投诉/举报工作人员会在5个工作日内联系你，一经查实，本站将立刻删除涉嫌侵权内容。

联系
我们

首页 > 代码库 > [转] Request 接收参数乱码原理解析

[转] Request 接收参数乱码原理解析

看完仍有疑问？有类似问题直接问程序猿