Publishing data as Linked Data on the Web of Data enables the integration of different data sources. It allows for displaying and querying different data sources. Furthermore it allows the integrating data describing the same entity.
When publishing Linked Data on the Web, data is represented using the Resource Description Framework (RDF).
The Web of Linked Data is built upon two simple ideas: Structured data is published on the Web using dereferencable HTTP URIs to represent data items wherein related data items are connected using RDF links.
It is desirable to not only publish the data but to also publish its schema as Linked Data. Thus, Linked Data applications can e.g. customize views on the data.
The format of the original data which is to be published as Linked Data is relevant for the publications steps to take.
If the original data is available in a relational database, we recommend using D2R Server to publish the data along with its schema as Linked Data. D2R Server offers a HTML, Linked Data and SPARQL interface to the published data. It is written in Java and licensed under the Apache License V2.0.
If the original data is available in any structured format, it has to be converted to RDF by a converter.
For CSV and Excel files we recommend using the RDF Extension for Google Refine. It adds a graphical user interface for exporting data of Google Refine projects in RDF format. The extension is available under the BSD license.
For XML files we recommend using XSLT to transform them into RDF.
Any other formats have to be converted using your own converter. You can also check the converter list at the W3C Semantic Web wiki.
If you have converted your data into RDF, you have two options to publish it on the Web as Linked Data.
You can serve the RDF file(s) using any web server. You will have to enable URL rewriting to make the Linked Data URIs in your data set dereferencable. We can recommend this method for publishing small data sets or vocabularies.
For bigger data sets we recommend setting up a Triple store. Triple stores allow for storing Linked Data and providing it using different interfaces. Via a SPARQL interface the data is made accessible for querying. Ideally Triple stores provide a Linked Data interface. In addition, a HTML interface can be offered.
A list of Triple stores is available at the W3C Semantic Web wiki. The Berlin SPARQL Benchmark (BSBM) is a benchmark for comparing the performance of Triple stores that expose SPARQL endpoints. The benchmark results on the store performances are available online.
As a Linked Data and HTML interface for any Triple store not offering those we recommend Pubby. Pubby makes it easy to turn a SPARQL endpoint into a Linked Data server. It is implemented as a Java web application and available under the Apache License V2.0.