On the cross-fertilization of geospatial and semantic web technology

MySpace plans to support the Semantic Web

According to the Web User News, MySpace plans to support Semantic Web technologies. From the article, it’s unclear exactly how the world’s largest social networking site plans to support RDF, RDFa or microformats. It may be still too early to celebrate.

At the SocialDevCamp East, we talked about the importance of Semantic Web technologies in context of social media. One question that didn’t get answered is “Who is going to publish semantic descriptions of social media data on the Web?”

Given that leaders like Digg and MySpace have plans to support the Semantic Web, I’m optimistic that the unanswered question will be answered soon.

Previously there was no way of linking this data, but the semantic web is able to retrieve and collate it. This means that it can now input your personal information should you join another social network.

MySpace users will still maintain complete control over what information they share and who gets to see it, but it will make sharing information across different platforms easier and quicker.

Yahoo announced earlier this year its plans to utilise the semantic web with a more efficient tagging system to give better search results.

MySpace’s DeWolfe said he hoped other networking sites, including Facebook, would sign up to the agreement.

Reflecting on the news Digg embraces RDFa

Digg, one of the popular social news web sites, announced that it will begin to support RDFa, a standard for embedding RDF statements in XML documents. Here is a screenshot of digg RDFa in action.

Although it’s unclear at the moment how this new feature will help digg to expand to its market share, but the downstream consequence is definitely positive. Technologies like RDFa and Microformats are crucial to the success of the Semantic Web.

My speculation is that HTML will continue to dominate the market of web publishing. People will continue to publish information in HTML because it’s the best markup language for displaying human-readable information in browsers. It’s the lowest common denominator for cross-platform information display. All desktop computers can run browsers to display HTML. Just about every mobile devices on the market today support some form of HTML rendering. In addition, there are incentives not to introduce other format representations because HTML contents can display well in mobile browsers like the Opera Mobile.

If HTML is here to stay, then from the Semantic Web development point of view, we must figure out how to publish semantic data along side with HTML. In general, there are two approaches: (1) publish the semantic data of each and every HTML pages in separate documents, (2) embed the semantic description in the same HTML pages. RDFa and Microformats are technologies of the latter.

There are pros and cons associated with both approaches. For this reason, I think in the near future we will see web applications to support both approaches. However, if you ask which approach will likely to attract web developers to share data, my answer is the latter approach (i.e., RDFa and Microformats).

First, they would require less overhead in Web development. Adding few extra HTML attributes in the existing template pages is relatively easy. But, creating separate full-blown RDF documents would require completely different set of business logic and template pages.

Second, the use of RDFa and Microformats can utilize the existing techniques for optimizing Web publishing. For example, caching is common technique used by many web sites to improve performance. If semantic data is embedded in HTML, then it can also be cached without much re-implementation.

Third, embedding semantic data in HTML gives web developers a sense of familiarity. People like to work with what they are familiar with, and many of them are reluctant to change. In an early stage of the Semantic Web movement, some web developers may show signs of resistance to RDF document publishing. But, convincing them to use RDFa and Microformats should be easy.

I’m happy to see that RDFa is adopted by Digg, and hope that more news sites will come to follow. I’m thinking that in the next release of gnizr, I will introduce the publishing of semantic data in RDFa or Microformats — some editing of the existing Freemarker template pages should do the trick.