November 30, 2010

Resources for using microformats and structured data

microformats

Deltina HayWe set out in this series discussing how the Semantic Web relies upon markup languages that tag Web content so it is easier for machines to interpret. This can be accomplished in a number of ways including tagging content as structured data or linked data. The last article in this series provided an introduction to marking up your content as structured data using microformats.

Microformats are one of the standard markup formats used to create structured data. Like any markup language, they consist of tags and attributes that are used to “mark up” your Web content so that a search engine can recognize the content as structured data.

I was originally going to continue this series with an article about creating structured data using RDFa, but realized that there are so many great resources out there on microformats that I would hate to leave the topic without mentioning them.

Following is a list of tools and other resources that can help you mark up your content as structured data to prepare it for the Semantic Web.

Microformats templates

In the last article on using microformats to create structured data, I mentioned some tools that can help you generate your own structured content using microformats. Here are links to those and some additional templates:

November 9, 2010

The benefits of structuring your data using microformats

Google Rich Snippet Testing Tool
Google’s Rich Snippet Testing Tool

Deltina HayLast week we discussed how the Semantic Web relies upon markup languages that tag Web content, making it easier for machines to interpret. This can be accomplished in a number of ways, including tagging content as structured data or linked data.

Today we’ll take a look at marking up your content as structured data using microformats.

Microformats for structured data

Microformats are one of the standard markup formats used to create structured data. Like any markup language, they consist of tags and attributes that are used to “mark up” your Web content so that a search engine can recognize the content as structured data.

Content that is typically marked up using this standard includes contact and location information, reviews, products, and events. To transform your data into structured data using microformats, you simply add some additional classes and tags to your existing HTML, adhering to the microformats standard.

To demonstrate, let’s look at the “hCard” format. This format is used for marking up information about people, companies, organizations, and places. Here is how the marked-up content will look within the HTML of your Web page:

————-

<div id=”hcard-Deltina-Hay” class=”vcard”>
<a class=”url fn” href=”http://www.plumbwebsolutions.com”>Deltina Hay</a>
<div class=”org”>PLUMB Web Solutions</div>

<a class=”email” href=”mailto:[email protected]”>[email protected]</a>
<div class=”adr”>
<div class=”street-address”>P.O. Box 242</div>
<span class=”locality”>Austin</span>

<span class=”region”>Texas</span>
<span class=”postal-code”>78767</span>
<span class=”country-name”>USA</span>
</div>

<div class=”tel”>512-555-9999</div>
</div>

————-

And this is how it will appear on your website:

Deltina Hay
PLUMB Web Solutions
[email protected]
P.O. Box 242
Austin, Texas, 78767 USA
512-555-9999

To the naked eye, there is nothing special about this content. It is nothing more than your contact information with links. Search engines and Internet browsers, however, will now be able to interpret the content as structured data — specifically structured contact and location information about you and your company — and display it or use it accordingly. All you need to do is mark up your existing contact information using the microformats standards.

Microformats.org has a lot of resources to help you out, including an hCard creator that you can use to generate code similar to that in our example. Continue reading

November 1, 2010

The Semantic Web: An explanation in plain English

E&O search results
An example of Google Rich Snippets.

Deltina HayThe Semantic Web is a big step toward Web 3.0, where the ultimate goal is to make Web content more machine-friendly and thus, in turn, more useful to humans.

Most websites are produced using HTML, which is a markup language used to make a website “look” a certain way. The Semantic Web, on the other hand, is based on markup languages that focus on tagging the content by what it “means.”

A more “semantic” Internet will allow search engines to produce more relevant results because the searched content will be “marked up” in such a way that the engines (machines) can make better sense of it.

The Semantic Web is not AI (artificial intelligence), as some people seem to think. It is about making the content easier for machines to interpret, not about making the machines themselves smarter. Two ways in which this is accomplished is through structured data and linked data.

Structured data: Making it easier to share information

You can prepare your content in a way that will help search engines include it in very relevant search results. For instance, you can offer ways for your contact information, products or reviews to show up directly in a Google or Yahoo search result by adding a few tags to your content that will transform it into what is called “structured data.”

Contact and location information, events, products and reviews are all perfect types of structured data and can be tagged in standard formats called “markup formats” to make it easy for search engines to recognize them as such.

Structured data has been around for some time, waiting in the wings for the search engines to take it seriously. In 2009, Google introduced “Rich Snippets,” a feature that recognizes markup formats and displays the content in your search listing accordingly. See the image at top for an example.

Google is supporting the two most standard markup formats: Microformats and RDFa. Both of these standard formats are very straightforward. Anyone with experience building a website or using a content management system like WordPress can easily use them to mark up their existing Web content as structured data.

Linked data: Create apps from rich datasets

Linked Data also refers to a way of structuring data, but it does so by using the Web to create links between data from many different datasets and classifies it using an established data commons.

By using a common reference to represent a piece of data, that data can be linked easily to and from other sources of data, creating what is referred to as a “Web of Data.”

The most impressive of these Webs of Data is the Linked Open Data (LOD) cloud. In the center of this “cloud” (only a small part of it) is “Dbpedia,” which is the dataset that feeds Wikipedia.

Linked Open Data Cloud

The resulting “Web of Data” can be accessed by semantic Web browsers that navigate between different data sources, similar to how traditional Web browsers navigate between HTML pages.

One of the things that make Linked Data so powerful is what one can do with the data once it is linked. Given the right tools and know-how, anyone can draw from this tremendous resource to create powerful applications. Continue reading

May 5, 2010

The untapped power of APIs


The power of APIs from JD Lasica on Vimeo.

How you can let outsiders mine your data for gold

JD LasicaIn tech circles, the importance of APIs are a given. Flickr, Twitter, Google Maps and Facebook all became dominant in their sectors partly through the genius of releasing open APIs to outside developers.

I’ll let our sister site, Socialbrite, handle the definition of APIs, but here are some examples to put it in concrete terms:

See the Flickr widget there in the sidebar to the right? It contains an API that allows this WordPress blog to display the images from Flickr in a certain way.

See the Twitter conversations widget there on the right? It contains an API that allows this blog to pull and display Twitter tweets containing certain keywords.

See the Facebook and Twitter logos at the bottom of comments on this story page? It contains an API that lets people log in via those social sites to leave a comment on this blog. Same for the Social media jobs widget and Upcoming calendar widget in Socialmedia.biz’s sidebar.

Over the years, I’ve come to understand that almost nobody in the media world (outside of the techies), nobody in the nonprofit world and few in the business world understand the importance of this — and the untapped power that lies in creating your own APIs. In my talk at NewComm Forum 11 days ago I included a discussion of how news publications should get their hands on public data related to public schools, public hospitals, Census figures and the like — and release the raw data, not just the resulting stories, to the public.

A well-structured API can let outsiders:

• create a tool or widget that slices and dices public data in surprising new ways, bringing additional meaning and value to information that your own staff may not be aware of;

• create an app that displays specific sets of your content in interesting new ways — say, optimized for a mobile device;

• bring greater context to your data or content, comparing it to data provided by others in your sector;

• do lots more. But let Greg Elin tell it.

I’ve bumped into Greg often over the years during his tenure at the Sunlight Foundation. Now he’s chief technology officer of United Celebral Palsy.

APIs: When simple is better

At a recent retreat in Marin County, Greg and I started chatting about APIs when I whipped out my Kodak zi8 recorder (on loan from Kodak) and captured this high-quality video of him explaining the value of APIs and how organizations and businesses can put them to good use.

“You’re turning your assets into little tiny Legos that other people can use to build new things.”
– Greg Elin, UCP.org

Greg gave one of simplest definitions of an API I’ve heard: An API is a “remote control for a piece of software or a computer. … I can write software that has remote control over another piece of software that has an API.”

An API can supply answers to questions like: Where’s a good restaurant in town? What’s the latest stock market quotes? By providing an API, Greg says, “you allow other people to add value to your content and your information. … You’re turning your assets into little tiny Legos that other people can use to build new things.”

Watch, download or embed the video on Vimeo Continue reading