InfoQ

News

What Social Networks Are Teaching Us About Data Portability

Posted by Steven Robbins on May 22, 2008 07:55 PM

Community
Architecture
Topics
Data Portability ,
Data Access ,
Technology ,
Community ,
Open Source
Tags
Facebook ,
MySpace ,
OAuth ,
OpenID ,
Emergent Architecture ,
Microformats ,
Google
As more social networking sites are popping up, the questions around the data they keep are rising. Data portability has become the watch phrase across the Web 2.0 world. Is there something to be learned about data access and portability from these services?

Several of the major Web 2.0 players and services have made announcements about making the data they store "available" to the users who own it or aggregating access to data from other services. MySpace, Yahoo, eBay, Twitter, and Photobucket agreed to a partnership under the MySpace Data Availability initiative. Facebook announced their Facebook Connect technology to allow members to access their profile data from places other than Facebook. Google launched the preview release of Friend Connect that will allow users to see and interact across several social networks. Friendfeed released an API to allow programatic access to their multi-site aggregation capabilities.

In the background, but moving to the forefront, The DataPortability Project has been bringing together partners, technology, principles, and practices to make data portability and ownership a priority and an achievable goal. Their organization mission is
The DataPortability Project is a group created to promote the idea that individuals have control over their data by determining how they can use it and who can use it. This includes access to data that is under the control of another entity.
DataPortability listed the main points of their philosophy as:
  • You should be able to decide what you do with that data and how it gets used by others
  • Open Source solutions are preferred to closed source proprietary solutions
  • Bottom-up distributed solutions are preferred to top down centralized solutions
Among the main technologies that the Project focused on were OpenID, OAuth, RSS, OPML, microformats, RDF, apml, and XMPP.

While these technologies have been strongly tied to social networking, they have also been picking up usage in other areas as well. OAuth has been making inroads with Google Data APIs and Yahoo Fire Eagle API. Spring Security (Acegi) added OpenID support. Most all of the major browsers have already added or announced microformat support of one kind or another.

The growth in interest and technologies surrounding data availability, portability, and aggregation has ramifications on design and development of applications outside the social network space. The more that Software-as-a-Service and cloud computing are picked as enterprise and application models, the more distributed systems become. The distribution can lead to much more decentralization, even beyond the enterprise/organizational boundaries. This can be seen in healthcare with the rise of the Personal Health Record (PHR). With names like Google and Microsoft announcing PHR offerings over the web, data portability and availability will start hitting home with many more people than just those on social networking sites.

Mark Scrimshire summed up the data portability needs for PHRs:
The challenge that the Data portability movement is attempting to address is closely paralleled by the evolution of the Personal Health Record in the healthcare industry. We will want to own our own health information, but we will need to be able to share that information with medical providers and others. We will need a universal ability to share information, but share it securely. At the same time the process of managing access will need to be easy. We can learn an awful lot from the simple approaches that characterize OpenID, OAuth, microformats and other pervasive technologies that have succeeded on the Web.

No comments

Watch Thread Reply

Educational Content

Bindings, Platforms, and Innovation

This presentation focuses on the Internet and separating myth from fact, history from the future, and the mundane from the imaginative. Bob Frankston presents a vision of what could and should be.

Orchestrating Long Running Activities with JBoss / JBPM

This article explores the use of JBoss and jBPM to implement design solutions that effectively address the issue of orchestrating long running activities.

Neo4j - The Benefits of Graph Databases

This presentation covers the use of graph databases as an optimal solution for data that is difficult to fit in static tables, rapidly evolving data or data that has a lot of optional attributes.

Realistic about Risk: Software development with Real Options

This session introduces Real Options and shows how it can help in running your project. Real Options is a decision-making process that can be used to manage risk.

Communication Flexibility Using Bindings

This article discusses the use of bindings on services and references (including the instance of non-configured bindings) as the means to implement SCA communications in a Web and SOA environment.

Writing DSLs in Groovy

After a short introduction to DSLs, Scott Davis plays with the keyboard showing how to approach the creation of a DSL by typing working snippets of Groovy code that get executed.

Scaling Agile with C/ALM (Collaborative Application Lifecycle Management)

IBM Rational and InfoQ present, Scaling Agile with C/ALM, an eBook showing organizations how to become “finely tuned software delivery machines” by enabling team integration and scaling.

Concurrent Programming with Microsoft F#

Amanda Laucher presents a real life enterprise application written in F#. She shows actual code snippets, explaining design decisions and suggesting how to use some of the F# constructs.