Alan Davoust - Carleton University - Department of Systems and Computer Engineering

Abstract

Many data sharing systems are open to arbitrary users on the Internet, who are independent and self-interested agents. Therefore, in addition to traditional design goals such as technical performance, data sharing systems should be designed to best support the strategic interactions of these agents.

Our research hypothesis is that designs that maximize the participants' autonomy can produce useful data sharing systems.

We apply this design principle to both the system architecture and the functional design of a data sharing system, and study the resulting class of systems, which we call Decentralized Social Data Sharing ((DS)²) systems.

We formally define this class of systems and provide a reference implementation and an example application: a distributed wiki system called P2Pedia. P2Pedia implements a decentralized collaboration model, where the users are not required to reach a consensus, and instead benefit from being exposed to multiple viewpoints. We demonstrate the value of this collaboration model through an extensive user study.

Allowing the users to autonomously control their data prevents the system architecture from being optimized for efficient query processing. We show that Regular Path Queries, a useful class of graph queries, can still be processed on the shared data: although in the worst case such queries are intractable, we propose a cost estimation technique to identify tractable queries from partial knowledge of the data.

Through simulation, we also show that the users' control over network connections allows them to self-organize and interact with other users with whom their interests are best aligned. This may result in less data being available, and we study cases where this is in fact demonstrably beneficial to the users, as the available data to each user is the most relevant to them.

This suggests that querying this reduced collection of shared data may lead to more tractable query processing without necessarily reducing the users' utility.

	Teaching: (winter 2016) SYSC3020, Introduction to Software Engineering lecture notes on github (WIP) SYSC4806, Software Engineering Lab
	Publications at DBLP
	Ph.D. Thesis: Decentralized Social Data Sharing Show Abstract Abstract Many data sharing systems are open to arbitrary users on the Internet, who are independent and self-interested agents. Therefore, in addition to traditional design goals such as technical performance, data sharing systems should be designed to best support the strategic interactions of these agents. Our research hypothesis is that designs that maximize the participants' autonomy can produce useful data sharing systems. We apply this design principle to both the system architecture and the functional design of a data sharing system, and study the resulting class of systems, which we call Decentralized Social Data Sharing ((DS)²) systems. We formally define this class of systems and provide a reference implementation and an example application: a distributed wiki system called P2Pedia. P2Pedia implements a decentralized collaboration model, where the users are not required to reach a consensus, and instead benefit from being exposed to multiple viewpoints. We demonstrate the value of this collaboration model through an extensive user study. Allowing the users to autonomously control their data prevents the system architecture from being optimized for efficient query processing. We show that Regular Path Queries, a useful class of graph queries, can still be processed on the shared data: although in the worst case such queries are intractable, we propose a cost estimation technique to identify tractable queries from partial knowledge of the data. Through simulation, we also show that the users' control over network connections allows them to self-organize and interact with other users with whom their interests are best aligned. This may result in less data being available, and we study cases where this is in fact demonstrably beneficial to the users, as the available data to each user is the most relevant to them. This suggests that querying this reduced collection of shared data may lead to more tractable query processing without necessarily reducing the users' utility. Full text in pdf/a (259p, 4.2Mb) Compact version (11pt font, single spaced, 165p, 2.4Mb)
	M.A.Sc Thesis: Collaborative Knowledge Construction in a Peer-to-Peer File-Sharing Network (Awarded Carleton University Senate Medal, 2009) Show Abstract Abstract The Semantic Web is a vision of a web of machine processable data, encoding knowledge, that agents can reason about to process complex tasks. This vision relies on large amounts of data describing interrelated resources. One approach to producing such data is the ``Web 2.0" approach, of collaborative production of content by end users. We seek an alternative to the centralized control model of web-based solutions. Our approach uses the principles of peer-to-peer file sharing, where data is also contributed by end users, with decentralized control, and a data circulation model where files are replicated by downloads, causing popular or useful files to spread in the network. Building on earlier work where this paradigm was applied to structured documents, we extend this approach to support a graph data model. We introduce a formal model for schema-based file sharing systems, then extend this model with links between shared documents. We propose a query language for this graph of interlinked documents. Finally, we describe the design of our prototype implementation. Full text in pdf

Please note that I am now at UQO and you should be automatically redirected. Otherwise click here: My new page at UQO.