Internationalization of URLs

Author: Martin J. Dürst

(mduerst@ifi.unizh.ch) This represents my view in 1996, it is outdated. For an update, click here. This document summarizes my impressions and insights from earlier discussions on the net as well as some proposals for progress.

Overview

URLs are identifiers for resources on the internet. Due to the way their syntax is currently defined, they are in some cases very obviously self-explaining (hypothetical example: mailto:john.smith@ibm.com), whereas in other cases, this is completely different (immagine a similar address in Japan, that has to be written in Latin letters to conform to URL syntax, but that is not as understandable to a reader that is used mainly to Japanese characters). There is therefore a clear and certain need for the internationalization of URLs. However, this need up to now has only met several obstacles, and has not yet found an appropriate solution.

URLs as Seen by their Inventors

To understand the full magnitude of the problem of internationalizing URLs, it is important to understand what the creators of URLs and related items (URNs, URIs,...) had intended them for.

URLs in Practice

To the above abstract view contrasts the practical daily use of URLs:

Urgency of Internationalization of Different Schemes and Parts

For different schemes/protocols and for different parts of these schemes, e.g. representing domain names, path names, file names, or arbitrary data, the urgency of internationalization differs widely. Roughly, the following categories can be distinguished:

Some Possible Solutions

The need for serious improvement of the situation is well recognized among the comunity concerned with internationalization. It seems hovewer to be less of a concern for those communities controlling URLs in general and for those controlling the respective protocols. The decisive factor for a serious internationalization of URLs therefore migth be more organizatorial/political than technical. On the technical side, solutions that don't break existing implementations (difficult to check) and are otherwise unobtrusive may be preferred. On the organizatorial level, I see the importance that the I18N community agrees on

Without such a consensus, I see it very difficult to get others to accept our proposals. Before becomming too pessimistic, here a few actual proposals, some of which can be combined together:

Other Proposals/Discussions

Conclusions

There is great need for an internationalization of URLs, but there are still many problems on a conceptional level, on a technical level, and on an organizatorial/political level that have to be attacked. I hope that we can make as much progress as possible at the upcomming workshop.


Last updated May 4th, 1996, mduerst@ifi.unizh.ch