[Sputnik-list] Sputnik 8.07.07

Petite Abeille petite.abeille at gmail.com
Mon Jul 7 15:46:40 GMT+2 2008


On Jul 7, 2008, at 11:14 AM, Yuri Takhteyev wrote:

> 5. There are many other new features, see
> http://sputnik.freewisdom.org/en/Features

Cool. One thing though, URLs containing non-ASCII characters must be  
encoded appropriately...

For example, under the 'Features' URL:

http://sputnik.freewisdom.org/en/Features

<li>Content and page names in <strong>any script</strong> with UTF8.  
(see <a href="/en/Спутник" class="local">Спутник</a> and  
<a href="/en/史潑尼克" class="local">史潑尼克</a>).(Diff works  
better for languages that use spaces, e.g. diff for Chinese is on line- 
by-line basis.  No support for bi-directional text at the moment.)   
User names can be in any script too.</li>

href="/en/Спутник"
href="/en/史潑尼克"

While "Спутник" and "史潑尼克" are fine as titles, they do  
not appear to be properly encoded URLs.

In other words, if I'm not mistaken, one cannot simply use raw UTF-8  
byte sequences, one need to encode them first, e.g.:

/en/Спутник  -> /en/%d0%a1%d0%bf%d1%83%d1%82%d0%bd%d0%b8%d0%ba
/en/史潑尼克  -> /en/%e5%8f%b2%e6%bd%91%e5%b0%bc%e5%85%8b

Cheers,

--
PA.
http://alt.textdrive.com/nanoki/








More information about the Sputnik-list mailing list