Using Visual Analytics to Detect Problems in Datasets Collected From Photo-Sharing Services

Kachkaev, A. & Wood, J. (2012). Using Visual Analytics to Detect Problems in Datasets Collected From Photo-Sharing Services. Poster presented at the IEEE Conference on Information Visualization (InfoVis), 14 - 19 Oct 2012, Seattle, Washington, US.

[img]
Preview
PDF (Summary)
Download (1342kB) | Preview
    [img]
    Preview
    PDF (Poster)
    Download (4MB) | Preview

      Abstract

      Datasets that are collected for research often contain millions of records and may carry hidden pitfalls that are hard to detect. This work demonstrates how visual analytics can be used for identifying problems in the spatial distribution of crawled photographic data in different datasets: Picasa Web Albums, Panoramio, Flickr and Geograph, chosen to be potential data sources for ongoing doctoral research. This poster summary describes a number of problems found in the datasets using visual analytics and suggests that greater attention should be paid to assessing the quality of data gathered from user-generated photographic content. This work is the first part of a three-year PhD project aimed at producing a pedestrian-routing system that can suggest attractive pathways extracted from user-generated photographic content.

      Item Type: Conference or Workshop Item (Poster)
      Uncontrolled Keywords: Visual Analytics, Data Quality Assessment, Flickr, Panoramio, Picasa, Geograph
      Subjects: Z Bibliography. Library Science. Information Resources > Z665 Library Science. Information Science
      Divisions: School of Informatics > giCentre
      URI: http://openaccess.city.ac.uk/id/eprint/1320

      Actions (login required)

      View Item

      Document Downloads

      More statistics for this item...