On the cross-fertilization of geospatial and semantic web technology

Metalink unifies Internet downloads

Downloading files is one of the most frequently performed task on the Internet. Today there are many protocols for downloading files — e.g., HTTP, FTP and bittorent. Different sites (e.g., Linux distributions, OpenOffice) offer downloads in some or all those protocols. Metalink is a new XML-based markup language and technology for unifying file downloads across multiple protocols. Digging deeper into the Metalink format representation, I find some surprises — the use of tagging and expressive content description.

Problem

There are at least two problems existing in the current file downloading scheme.

One, it’s difficult for the end users to discover the most speedy way to download a file. Sites often list dozens of mirror servers for file downloads. Depending on the workload of each server, the download time can vary greatly. For the users, it’s not always easy to find out which mirror server can offer the shortest download time.

Second, there is no unified format representation for descrbing file downloads. For protocols such as FTP and HTTP, it’s not usually easy to say things like “from this URL, you can download a copy of OpenOffice version 2.1.0 for the Windows platform.” Although some P2P protocol such bittorent allows the explicit description of file downloads, but such description is not usually understood by the non-P2P clients (e.g., Web Browsers).

What’s Metalink?

In the Metalink 3.0 Specification, Metalink is defined as the follows.

Metalinks bundle the various ways (FTP/HTTP/P2P) to acquire files into one format for easy content distribution/Electronic Software Distribution (ESD). This can increase download performance, reliability, usability, & efficiency. Any program that downloads files could potentially use it. A Metalink is a Multi Method Metalinker (MMM or M3talinker) document. MMM is a dialect of XML and MMM files must conform to the XML 1.0 specification. “MMM” might also look like “WWW” mirrored.

Solutions

Metalink solves the first problem — how to find the most speedy way to download a file — by grouping different download protocols into one protocol. This enables Metalink clients to automatically switch between different mirror servers without explicit user instructions.

The following is a code snippet of a Metalink description for the OpenOffice downloads:

metalink code snippet — openoffice

In the above example, not only multiple server information is described but also meta-data (e.g., the location of the server, document tags and license). Although this information is not in RDF, but it’s clear that it provides expressive description about file downloads. I imagine this information can be consumed by Metalink clients to build up smart search capabilities and maybe recommend files to users.

Remarks

I’m quite excited about Metalink. While there is no guarantee that Metalink will revolutionize Internet file downloads, but its use of markup languages to describe information on the Web and its attempt to shield users from heterogeneous download protocols are signs of the emerging Semantic Web.

Resources:

Sharing is Good. These icons link to social bookmarking sites where readers can share and discover new web pages.
  • del.icio.us
  • YahooMyWeb
  • Reddit
  • co.mments
  • Furl
  • Ma.gnolia
  • NewsVine
  • Simpy
  • bodytext
  • E-mail this story to a friend!
  • Facebook
  • Google
  • StumbleUpon
  • Technorati
  • TwitThis

2 Comments

  1. Unsurprisingly, you can filter downloads by location. Metalinks can also contain/list multiple files, so you can list the OpenOffice.org binaries for all platforms, all languages, and all locations (mirrors) in one .metalink, then filter by platform - one link for all downloads (or something). Unfortunately…the different Metalink download clients don’t all support this. :(

    So maybe someday soon, someone in Brazil that speaks French and uses Linux can click on the link & get the French Linux version from the local Brazilian mirrors. Someone in Japan that speaks English on Mac OS X can click that same link & get the English OS X version from the local Japanese mirrors.

    Not revolutionary, but could end up being cool.

    Comment by Ant Bryan — February 26, 2007 @ 3:27 am

  2. [...] my previous post, I wrote about Metalink, a new markup language and technology for unifying Internet download. The current Metalink language [...]

    Pingback by Geospatial Semantic Web Blog - GIS Data Integration, Geo Ontology, Geo Tagging & Geo Web 2.0 News » Blog Archive » Metalink meets RDF and SPARQL — March 11, 2007 @ 11:58 am

Leave a comment

XHTML: You can use these tags: <a href="" title=""> <abbr title=""> <acronym title=""> <b> <blockquote cite=""> <cite> <code> <del datetime=""> <em> <i> <q cite=""> <strike> <strong>