为什么FILTER_VALIDATE_URL不适用于波斯字符?

为什么FILTER_VALIDATE_URL不适用于波斯字符?

问题描述:

Here is my code:

$url = "http://localhost/myweb/qanda/1/";
var_dump(filter_var($url, FILTER_VALIDATE_URL));
//=> string(31) "http://localhost/myweb/qanda/1/"

It is known as a valid URL and all fine. But my code doesn't work when there is a Persian character in the URL:

$url = "http://localhost/myweb/qanda/1/س";
var_dump(filter_var($url, FILTER_VALIDATE_URL));
//=> bool(false)

Why? And how can I fix it?

以下是 我的代码: p>

  $ url =“http:// localhost / myweb / qanda / 1 /”; 
var_dump(filter_var($ url,FILTER_VALIDATE_URL))  ; 
 // =>  string(31)“http:// localhost / myweb / qanda / 1 /”
  code>  pre> 
 
 

它被称为有效的URL,一切正常。 但是,当网址中有波斯语字符时,我的代码无效: p> \ n

  $ url =“http:// localhost / myweb / qanda / 1 /س”; 
var_dump(filter_var($ url,FILTER_VALIDATE_URL)); 
 // => 布尔(假)
 代码>  PRE> 
 
 

为什么呢? 我该如何解决? p> div>

Well, basically because the string you've entered isn't a valid URL according to the RFC:

"Only alphanumerics [0-9a-zA-Z], the special characters "$-_.+!*'()," [not including the quotes - ed], and reserved characters used for their reserved purposes may be used unencoded within a URL"

Your client (the browser) will translate your character to %D8%B3 which is valid and send that to the server, but your client will probably still show س (mine does).

So basically, the correct URL would be validated like so:

$url = "http://localhost/myweb/qanda/1/%D8%B3";
var_dump(filter_var($url, FILTER_VALIDATE_URL));

You can safely use urlencode() to translate foreign characters.