Some VIVO Things Blog

Musings on the ecosystem, community, software, data, use, and whatever else comes to mind.

Connections

When I first saw VIVO, I was struck by the simplicity and power of its connective nature. Anything could be connected to anything through the elegant syntax of the semantic web. Over the years, there has been a focus on "profiles" -- assemblies of works of individual scholars, in formats that resemble department web pages, or curriculum vitae, or biosketches. This focus has created a perception that VIVO is a profile system. That's unfortunate.

VIVO can display profiles, and there is great value in using an open source, semantic web application to display profiles. But VIVO is much more than a profile system. VIVO creates an integrated "graph" of the entities it represents. Graphs are the natural structures for representing connections. Triples are the natural means of representing graphs in computers.

What connections are of interest? A simple, but perhaps not very useful answer is "all." We want to know which people used what resource. We want to know which grants funded what datasets. We want to know what events exhibited which presentations.

Suppose we wish to make connections between six kinds of things -- works, people, datasets, grants, organizations, and people. We can, of course, imagine other kinds of things we might wish to connect -- events, resources, locations, time periods, and so on. For six kinds of things, there are 15 kinds of pairs of things -- grants and people, works and datasets, plus the six reflexive pairings -- people to people, works to works, etc.

We can start to think of the kinds of connections there might be between various kinds of things. Consider people and grants. We have a person funded by a grant. A person is the principal investigator of a grant. A person might follow a grant, in the way a person might follow another person in social media. There could be many types of connections between people and grants. Each can be modeled in VIVO.

We can start to think of a thing and the types of connections that thing might have to other things. Consider a publication as a thing. The publication is connected to a journal, to its authors, to a time of publication. The publication is connected to its publisher through its journal, to various organizations through its authors, to a full text representation in a repository, to metrics regarding its use, possibly to grant funding, to software and datasets, to concepts, and to other publications through references and citations. In other words, the publication is a node in a connected graph consisting of other elements of scholarship.

Some connections have their own data -- data about the connection itself. A person has a position in an organization. The position is a connection between the person and the organization indicating employment. The employment has a time period, and a title. The position typically has a type. Each of these pieces of data are about the connection between the person and the organization.

Each connection is recorded as data in VIVO. VIVO has data about the connected things, and the nature of the connection between the things, and data about the connection between the things. Of all of these, the person has a bit of a special place in a VIVO. There has been more effort put into representing people and their connections, and more effort in displaying people and their connections. This is the so-called profile, and leads to the characterization of VIVO as a profile system. But underneath, VIVO is a connected graph, and we can have any amount of detail we wish about any of the elements of the graph.

What if we made profiles of events rather than people? What would an event "profile" look like? What information do we want to see? How far into the graph of connections would we reach? Are there important connections between events? Between events and other things in the connected graph of scholarship? What would we be able to know that is difficult to know now? The same questions can be asked of any of the graph entities.

What are the connections of interest? What connections are already in VIVO and which should be added? What is missing in order to make the connections of interest? If we had the connections we need, how would we use them to better understand and use the scholarship of our organization and of the world? What tools and capabilities would we create?

In future columns, I'll share some thoughts about these questions. Please share your thoughts with others. It is through sharing that we will grow our understanding of the utility of connections, and of what we must build next.