Open AccessArticle
A Data Quality Strategy to Enable FAIR, Programmatic Access across Large, Diverse Data Collections for High Performance Data Analysis
Informatics 2017, 4(4), 45; doi:10.3390/informatics4040045 -
Abstract
To ensure seamless, programmatic access to data for High Performance Computing (HPC) and analysis across multiple research domains, it is vital to have a methodology for standardization of both data and services. At the Australian National Computational Infrastructure (NCI) we have developed a
[...] Read more.
To ensure seamless, programmatic access to data for High Performance Computing (HPC) and analysis across multiple research domains, it is vital to have a methodology for standardization of both data and services. At the Australian National Computational Infrastructure (NCI) we have developed a Data Quality Strategy (DQS) that currently provides processes for: (1) Consistency of data structures needed for a High Performance Data (HPD) platform; (2) Quality Control (QC) through compliance with recognized community standards; (3) Benchmarking cases of operational performance tests; and (4) Quality Assurance (QA) of data through demonstrated functionality and performance across common platforms, tools and services. By implementing the NCI DQS, we have seen progressive improvement in the quality and usefulness of the datasets across the different subject domains, and demonstrated the ease by which modern programmatic methods can be used to access the data, either in situ or via web services, and for uses ranging from traditional analysis methods through to emerging machine learning techniques. To help increase data re-usability by broader communities, particularly in high performance environments, the DQS is also used to identify the need for any extensions to the relevant international standards for interoperability and/or programmatic access. Full article
Figures

Figure 1

Open AccessArticle
Web-Based Scientific Exploration and Analysis of 3D Scanned Cuneiform Datasets for Collaborative Research
Informatics 2017, 4(4), 44; doi:10.3390/informatics4040044 -
Abstract
The three-dimensional cuneiform script is one of the oldest known writing systems and a central object of research in Ancient Near Eastern Studies and Hittitology. An important step towards the understanding of the cuneiform script is the provision of opportunities and tools for
[...] Read more.
The three-dimensional cuneiform script is one of the oldest known writing systems and a central object of research in Ancient Near Eastern Studies and Hittitology. An important step towards the understanding of the cuneiform script is the provision of opportunities and tools for joint analysis. This paper presents an approach that contributes to this challenge: a collaborative compatible web-based scientific exploration and analysis of 3D scanned cuneiform fragments. The WebGL -based concept incorporates methods for compressed web-based content delivery of large 3D datasets and high quality visualization. To maximize accessibility and to promote acceptance of 3D techniques in the field of Hittitology, the introduced concept is integrated into the Hethitologie-Portal Mainz, an established leading online research resource in the field of Hittitology, which until now exclusively included 2D content. The paper shows that increasing the availability of 3D scanned archaeological data through a web-based interface can provide significant scientific value while at the same time finding a trade-off between copyright induced restrictions and scientific usability. Full article
Figures

Figure 1

Open AccessArticle
Relative Quality and Popularity Evaluation of Multilingual Wikipedia Articles
Informatics 2017, 4(4), 43; doi:10.3390/informatics4040043 -
Abstract
Despite the fact that Wikipedia is often criticized for its poor quality, it continues to be one of the most popular knowledge bases in the world. Articles in this free encyclopedia on various topics can be created and edited in about 300 different
[...] Read more.
Despite the fact that Wikipedia is often criticized for its poor quality, it continues to be one of the most popular knowledge bases in the world. Articles in this free encyclopedia on various topics can be created and edited in about 300 different language versions independently. Our research has showed that in language sensitive topics, the quality of information can be relatively better in the relevant language versions. However, in most cases, it is difficult for the Wikipedia readers to determine the language affiliation of the described subject. Additionally, each language edition of Wikipedia can have own rules in the manual assessing of the content’s quality. There are also differences in grading schemes between language versions: some use a 6–8 grade system to assess articles, and some are limited to 2–3. This makes automatic quality comparison of articles between various languages a challenging task, particularly if we take into account a large number of unassessed articles; some of the Wikipedia language editions have over 99% of articles without a quality grade. The paper presents the results of a relative quality and popularity assessment of over 28 million articles in 44 selected language versions. Comparative analysis of the quality and the popularity of articles in popular topics was also conducted. Additionally, the correlation between quality and popularity of Wikipedia articles of selected topics in various languages was investigated. The proposed method allows us to find articles with information of better quality that can be used to automatically enrich other language editions of Wikipedia. Full article
Figures

Figure 1

Open AccessReview
Requirements and Pitfalls in AAL Projects. Guide to Self-Criticism for Developers from Experience
Informatics 2017, 4(4), 42; doi:10.3390/informatics4040042 -
Abstract
Since 2012, several national and international projects on ambient assisted living (AAL) active and healthy ageing gave insight into the different steps of development processes where the requirements of the target group were not met or just failed to be realized. This article
[...] Read more.
Since 2012, several national and international projects on ambient assisted living (AAL) active and healthy ageing gave insight into the different steps of development processes where the requirements of the target group were not met or just failed to be realized. This article shall provide a summary of terms and requirements from the experience of practice and project work on the topic. This article is a reflection and guide for self-criticism for technical developers. Starting with some terms and how they are linked to each other gives an overview of the field of action for the projects, which our experience is based on. A mixture between literature and results of projects illustrates the requirement analysis as it was anticipated in recent years. The conclusion consists of recommendations for further research on requirements. Full article
Figures

Figure 1

Open AccessArticle
A Proposed Business Intelligent Framework for Recommender Systems
Informatics 2017, 4(4), 40; doi:10.3390/informatics4040040 -
Abstract
In this Internet age, recommender systems (RS) have become popular, offering new opportunities and challenges to the business world. With a continuous increase in global competition, e-businesses, information portals, social networks and more, websites are required to become more user-centric and rely on
[...] Read more.
In this Internet age, recommender systems (RS) have become popular, offering new opportunities and challenges to the business world. With a continuous increase in global competition, e-businesses, information portals, social networks and more, websites are required to become more user-centric and rely on the presence and role of RS in assisting users in better decision making. However, with continuous changes in user interests and consumer behavior patterns that are influenced by easy access to vast information and social factors, raising the quality of recommendations has become a challenge for recommender systems. There is a pressing need for exploring hybrid models of the five main types of RS, namely collaborative, demographic, utility, content and knowledge based approaches along with advancements in Big Data (BD) to become more context-aware of the technology and social changes and to behave intelligently. There is a gap in literature with a research focus in this direction. This paper takes a step to address this by exploring a new paradigm of applying business intelligence (BI) concepts to RS for intelligently responding to user changes and business complexities. A BI based framework adopting a hybrid methodology for RS is proposed with a focus on enhancing the RS performance. Such a business intelligent recommender system (BIRS) can adopt On-line Analytical Processing (OLAP) tools and performance monitoring metrics using data mining techniques of BI to enhance its own learning, user profiling and predictive models for making a more useful set of personalised recommendations to its users. The application of the proposed framework to a B2C e-commerce case example is presented. Full article
Figures

Figure 1

Open AccessArticle
Data Governance in the Sustainable Smart City
Informatics 2017, 4(4), 41; doi:10.3390/informatics4040041 -
Abstract
The wisdom of ‘smart’ development increasingly shapes urban sustainability in Europe and beyond. Yet, the ‘smart city’ paradigm has been critiqued for favouring technological solutions and business interests over social inclusion and urban innovation. Despite the rhetoric of ‘citizen-centred approaches’ and ‘user-generated data’,
[...] Read more.
The wisdom of ‘smart’ development increasingly shapes urban sustainability in Europe and beyond. Yet, the ‘smart city’ paradigm has been critiqued for favouring technological solutions and business interests over social inclusion and urban innovation. Despite the rhetoric of ‘citizen-centred approaches’ and ‘user-generated data’, the level of stakeholder engagement and public empowerment is still in question. It is unclear how smart city initiatives are developing common visions according to the principles of sustainable urban development. This paper examines how data governance in particular is framed in the new smart city agenda that is focused on sustainability. The challenges and opportunities of data governance in sustainability-driven smart city initiatives are articulated within a conceptual Framework on Sustainable Smart City Data Governance. Drawing on three cases from European countries and a stakeholder survey, the paper shows how governance of data can underpin urban smart and sustainable development solutions. The paper presents insights and lessons from this multi-case study, and discusses risks, challenges, and future research. Full article
Figures

Figure 1

Open AccessConcept Paper
Big Data in the Era of Health Information Exchanges: Challenges and Opportunities for Public Health
Informatics 2017, 4(4), 39; doi:10.3390/informatics4040039 -
Abstract
Public health surveillance of communicable diseases depends on timely, complete, accurate, and useful data that are collected across a number of healthcare and public health systems. Health Information Exchanges (HIEs) which support electronic sharing of data and information between health care organizations are
[...] Read more.
Public health surveillance of communicable diseases depends on timely, complete, accurate, and useful data that are collected across a number of healthcare and public health systems. Health Information Exchanges (HIEs) which support electronic sharing of data and information between health care organizations are recognized as a source of ‘big data’ in healthcare and have the potential to provide public health with a single stream of data collated across disparate systems and sources. However, given these data are not collected specifically to meet public health objectives, it is unknown whether a public health agency’s (PHA’s) secondary use of the data is supportive of or presents additional barriers to meeting disease reporting and surveillance needs. To explore this issue, we conducted an assessment of big data that is available to a PHA—laboratory test results and clinician-generated notifiable condition report data—through its participation in a HIE. Full article
Figures

Figure 1

Open AccessArticle
In Search of Smartness: The EU e-Justice Challenge
Informatics 2017, 4(4), 38; doi:10.3390/informatics4040038 -
Abstract
At the EU level, an increasing number of resources are being invested in an attempt to provide better public services through the use of Information and Communication Technology (ICT). While new tools are being designed and implemented, a shift from ‘traditional’ technologies that
[...] Read more.
At the EU level, an increasing number of resources are being invested in an attempt to provide better public services through the use of Information and Communication Technology (ICT). While new tools are being designed and implemented, a shift from ‘traditional’ technologies that must be used to provide services to more interactive ‘smart’ technologies is beginning to take place. At the same time, an adequate understanding of the implications of this shift is still missing. This paper focuses on the EU e-Justice experience with the ‘API-for-Justice’ project, which investigates the challenges of opening up the European e-Justice Digital Service Infrastructure to external service providers by means of Application Programming Interfaces (APIs). In particular, the exploration of potential services that can be provided by third parties through APIs for Justice shows the potential for a radical redesign of the justice service provision, where, for example, justice services are not requested by the party but are proposed or initiated by smart components of the infrastructure on the basis of inputs from the environment. In this perspective, smart technology research and, in particular, Brenner (2007)’s discussion on law and smart technology help to uncover the still unclear dynamics of change that characterize one of the key pillars of modern society: justice. Full article
Figures

Figure 1

Open AccessArticle
Thermal-Signature-Based Sleep Analysis Sensor
Informatics 2017, 4(4), 37; doi:10.3390/informatics4040037 -
Abstract
This paper addresses the development of a new technique in the sleep analysis domain. Sleep is defined as a periodic physiological state during which vigilance is suspended and reactivity to external stimulations diminished. We sleep on average between six and nine hours per
[...] Read more.
This paper addresses the development of a new technique in the sleep analysis domain. Sleep is defined as a periodic physiological state during which vigilance is suspended and reactivity to external stimulations diminished. We sleep on average between six and nine hours per night and our sleep is composed of four to six cycles of about 90 min each. Each of these cycles is composed of a succession of several stages of sleep that vary in depth. Analysis of sleep is usually done via polysomnography. This examination consists of recording, among other things, electrical cerebral activity by electroencephalography (EEG), ocular movements by electrooculography (EOG), and chin muscle tone by electromyography (EMG). Recordings are made mostly in a hospital, more specifically in a service for monitoring the pathologies related to sleep. The readings are then interpreted manually by an expert to generate a hypnogram, a curve showing the succession of sleep stages during the night in 30s epochs. The proposed method is based on the follow-up of the thermal signature that makes it possible to classify the activity into three classes: “awakening,” “calm sleep,” and “restless sleep”. The contribution of this non-invasive method is part of the screening of sleep disorders, to be validated by a more complete analysis of the sleep. The measure provided by this new system, based on temperature monitoring (patient and ambient), aims to be integrated into the tele-medicine platform developed within the framework of the Smart-EEG project by the SYEL–SYstèmes ELectroniques team. Analysis of the data collected during the first surveys carried out with this method showed a correlation between thermal signature and activity during sleep. The advantage of this method lies in its simplicity and the possibility of carrying out measurements of activity during sleep and without direct contact with the patient at home or hospitals. Full article
Figures

Open AccessArticle
Multiple-Criteria Decision Support for a Sustainable Supply Chain: Applications to the Fashion Industry
Informatics 2017, 4(4), 36; doi:10.3390/informatics4040036 -
Abstract
With increasing globalization and international cooperation, the importance of sustainability management across supply chains has received much attention by companies across various industries. Companies therefore strive to implement effective and integrated sustainable supply chain management initiatives to improve their operational and economic performance
[...] Read more.
With increasing globalization and international cooperation, the importance of sustainability management across supply chains has received much attention by companies across various industries. Companies therefore strive to implement effective and integrated sustainable supply chain management initiatives to improve their operational and economic performance while also minimizing unnecessary damage to the environment and maintaining their social reputation and images. The paper presents an easy-to-use decision-support approach based on multiple-criteria decision-making (MCDM) methodologies that aim to help companies develop effective models for timely decision-making involving sustainable supply chain management strategies. The proposed approach can be used by practitioners to ultimately build a comprehensive Analytic Network Process model that will adequately capture and reveal all the interrelationships and interdependency among the elements in the problem, which is often a very difficult task. To facilitate and simplify this complex process, we propose that hierarchical thinking be used first to structure the essences of the problem capturing only the major issues, and an Analytic Hierarchy Process (AHP) model be built. Users can learn from the modeling process and gain much insight into the problem. The AHP can then be extended to an Analytic Network Process (ANP) model so as to capture the relationships and interdependencies among the elements. Our approach can reduce the sustainable expertise, effort and information that are often needed to build an ANP model from scratch. We apply our approach to the evaluation of sustainable supply chain management strategies for the fashion industry. Three main dimensions of sustainability—environmental, economic and social—are considered. Based on the literature, we identified four alternative supply chain management strategies. It was found that the Reverse Logistics alternative appears to be the recommended solution by the AHP model. However, the Socially Leagile Supply Chain is recommended by the ANP model, thereby demonstrating the necessity and importance of considering interdependencies in the model. Full article
Figures

Figure 1

Open AccessArticle
Assessing the Cost Impact of Multiple Transportation Modes to Enhance Sustainability in an Integrated, Two Stage, Automotive Supply Chain
Informatics 2017, 4(4), 34; doi:10.3390/informatics4040034 -
Abstract
As the automotive industry has been striving to enhance its efficiency, competitiveness, and sustainability, great focus is often placed on opportunities for improving its supply chain operations. We study the effect of introducing multiple modes of transportation in an industry-motivated production and transportation
[...] Read more.
As the automotive industry has been striving to enhance its efficiency, competitiveness, and sustainability, great focus is often placed on opportunities for improving its supply chain operations. We study the effect of introducing multiple modes of transportation in an industry-motivated production and transportation problem involving short-term automotive supply chain planning. We consider multiple, heterogeneous modes of transportation that offer a cost vs. delivery time option to the manufacturer. Having multiple modes of transportation in the system promotes supply chain sustainability. We present an integer linear programming model that captures the availability of multiple transportation modes. We then provide a solution approach based on a hybrid simulated annealing algorithm that we use to analyze the problem. Experimental results demonstrate the impact of additional transportation mode lead times compared to costs in the integrated supply chain. Full article
Figures

Figure 1

Open AccessEditorial
Advancing Social Media and Mobile Technologies in Healthcare Education
Informatics 2017, 4(4), 35; doi:10.3390/informatics4040035 -
Abstract
Social media and mobile technologies are important new tools in healthcare education, both to assist healthcare professionals learn and maintain their craft, and for the education of patients and families [...]
Full article
Open AccessArticle
Health Literacy for the General Public: Making a Case for Non-Trivial Visualizations
Informatics 2017, 4(4), 33; doi:10.3390/informatics4040033 -
Abstract
Health literacy is concerned with the degree to which individuals can access and understand information to make health decisions. The multifaceted nature of health data presents challenges for individuals seeking to improve their understanding of health. To aid health literacy efforts, we have
[...] Read more.
Health literacy is concerned with the degree to which individuals can access and understand information to make health decisions. The multifaceted nature of health data presents challenges for individuals seeking to improve their understanding of health. To aid health literacy efforts, we have developed HealthConfection, a visualization tool that uses elaborate and non-typical interactive visualizations to represent health data. In this paper, we report on two studies we conducted with HealthConfection. In the first study, we investigate whether individuals can learn to use non-typical visualizations, and the impact that short, minimalist video tutorials will have on participants’ understanding of the visualizations. The findings from this study suggest that individuals can learn to use non-typical visualizations and that participants who used the tutorials achieved higher scores than those without tutorials. This work indicates that non-typical visualizations are a viable option for conveying complex datasets. Based on this foundation, we conducted a second study to investigate if non-typical visualizations can improve health literacy for the general public. Results show that participants who used HealthConfection achieved higher scores than those who did not interact with the tool. Our work suggests that non-typical visualizations can be used to improve health literacy. Full article
Figures

Figure 1

Open AccessArticle
How The Arts Can Help Tangible Interaction Design: A Critical Re-Orientation
Informatics 2017, 4(3), 31; doi:10.3390/informatics4030031 -
Abstract
There is a long history of creative encounters between tangible interface design and the Arts. However, in comparison with media art, tangible interaction seems to be quite anchored into many of the traditional methodologies imported from human–computer interaction (HCI). How can the Arts
[...] Read more.
There is a long history of creative encounters between tangible interface design and the Arts. However, in comparison with media art, tangible interaction seems to be quite anchored into many of the traditional methodologies imported from human–computer interaction (HCI). How can the Arts help tangible interaction design? Building on Søren Pold’s Interface Aesthetics, a re-orientation of the role of the artist towards a critical examination of our research medium—tangible interaction—is proposed. In this essay, the benefits of incorporating artistic research and its methodologies into our field are described. With these methodologies it is possible to better assess experiential aspects of interaction—a relevant attribute which traditional HCI approaches cannot afford. In order to inform our community, three examples of critical artworks are comparatively studied and discussed. Full article
Figures

Figure 1

Open AccessArticle
Evaluation Tools to Appraise Social Media and Mobile Applications
Informatics 2017, 4(3), 32; doi:10.3390/informatics4030032 -
Abstract
In a connected care environment, more citizens are engaging in their health care through mobile apps and social media tools. Given this growing health care engagement, it is important for health care professionals to have the knowledge and skills to evaluate and recommend
[...] Read more.
In a connected care environment, more citizens are engaging in their health care through mobile apps and social media tools. Given this growing health care engagement, it is important for health care professionals to have the knowledge and skills to evaluate and recommend appropriate digital tools. The purpose of this article is to identify and review criteria or instruments that can be used to evaluate mobile apps and social media. The analysis will review current literature as well as literature designed by professional health care organizations. This review will facilitate health care professionals’ assessment of mobile apps and social media tools that may be pertinent to their patient population. The review will also highlight strategies which a health care system can use to provide guidance in recommending mobile apps and social media tools for their patients, families, and caregivers. Full article
Open AccessArticle
digiMe: An Online Portal to Support Connectivity through E-Learning in Medical Education
Informatics 2017, 4(3), 30; doi:10.3390/informatics4030030 -
Abstract
Connectivity is intrinsic to all aspects of our life today, be it political, economic, technological, scientific, or personal. Higher education is also transcending the previous paradigm of technology enabled content delivery and e-learning, with a new emphasis on connectivity, enabling participants to exchange
[...] Read more.
Connectivity is intrinsic to all aspects of our life today, be it political, economic, technological, scientific, or personal. Higher education is also transcending the previous paradigm of technology enabled content delivery and e-learning, with a new emphasis on connectivity, enabling participants to exchange knowledge and collaborate to meet educational goals. In this study, a social media technology supported website—digiMe—was developed and evaluated at the School of Medicine of one Australian university. Connectivity to other medical learners and health professionals is intrinsic to digiMe. This paper reports the functionalities of this website, results of a post-intervention evaluative survey, and statistics of website usage generated from Google Analytics. The results revealed more active adoptions and a more positive attitude towards digiMe from Year 4 students compared to Year 5 students. The participants showed a desire for access to a recommended collection of apps, such as those offered through digiMe. However, many participants did not use digiMe beyond initial introduction to it. digiMe demonstrated its potential in raising awareness of web and mobile apps useful for enhancing connectivity, although it needs to be introduced to students in earlier years of their medical education to achieve a higher impact on their learning. Full article
Figures

Figure 1

Open AccessArticle
Scalable Interactive Visualization for Connectomics
Informatics 2017, 4(3), 29; doi:10.3390/informatics4030029 -
Abstract
Connectomics has recently begun to image brain tissue at nanometer resolution, which produces petabytes of data. This data must be aligned, labeled, proofread, and formed into graphs, and each step of this process requires visualization for human verification. As such, we present the
[...] Read more.
Connectomics has recently begun to image brain tissue at nanometer resolution, which produces petabytes of data. This data must be aligned, labeled, proofread, and formed into graphs, and each step of this process requires visualization for human verification. As such, we present the BUTTERFLY middleware, a scalable platform that can handle massive data for interactive visualization in connectomics. Our platform outputs image and geometry data suitable for hardware-accelerated rendering, and abstracts low-level data wrangling to enable faster development of new visualizations. We demonstrate scalability and extendability with a series of open source Web-based applications for every step of the typical connectomics workflow: data management and storage, informative queries, 2D and 3D visualizations, interactive editing, and graph-based analysis. We report design choices for all developed applications and describe typical scenarios of isolated and combined use in everyday connectomics research. In addition, we measure and optimize rendering throughput—from storage to display—in quantitative experiments. Finally, we share insights, experiences, and recommendations for creating an open source data management and interactive visualization platform for connectomics. Full article
Figures

Figure 1

Open AccessArticle
Sampling and Estimation of Pairwise Similarity in Spatio-Temporal Data Based on Neural Networks
Informatics 2017, 4(3), 27; doi:10.3390/informatics4030027 -
Abstract
Increasingly fast computing systems for simulations and high-accuracy measurement techniques drive the generation of time-dependent volumetric data sets with high resolution in both time and space. To gain insights from this spatio-temporal data, the computation and direct visualization of pairwise distances between time
[...] Read more.
Increasingly fast computing systems for simulations and high-accuracy measurement techniques drive the generation of time-dependent volumetric data sets with high resolution in both time and space. To gain insights from this spatio-temporal data, the computation and direct visualization of pairwise distances between time steps not only supports interactive user exploration, but also drives automatic analysis techniques like the generation of a meaningful static overview visualization, the identification of rare events, or the visual analysis of recurrent processes. However, the computation of pairwise differences between all time steps is prohibitively expensive for large-scale data not only due to the significant cost of computing expressive distance between high-resolution spatial data, but in particular owing to the large number of distance computations (O(|T|2)), with |T| being the number of time steps). Addressing this issue, we present and evaluate different strategies for the progressive computation of similarity information in a time series, as well as an approach for estimating distance information that has not been determined so far. In particular, we investigate and analyze the utility of using neural networks for estimating pairwise distances. On this basis, our approach automatically determines the sampling strategy yielding the best result in combination with trained networks for estimation. We evaluate our approach with a variety of time-dependent 2D and 3D data from simulations and measurements as well as artificially generated data, and compare it against an alternative technique. Finally, we discuss prospects and limitations, and discuss different directions for improvement in future work. Full article
Figures

Figure 1

Open AccessReview
Web Apps Come of Age for Molecular Sciences
Informatics 2017, 4(3), 28; doi:10.3390/informatics4030028 -
Abstract
Whereas server-side programs are essential to maintain databases and run data analysis pipelines and simulations, client-side web-based computing tools are also important as they allow users to access, visualize and analyze the content delivered to their devices on-the-fly and interactively. This article reviews
[...] Read more.
Whereas server-side programs are essential to maintain databases and run data analysis pipelines and simulations, client-side web-based computing tools are also important as they allow users to access, visualize and analyze the content delivered to their devices on-the-fly and interactively. This article reviews the best-established tools for in-browser plugin-less programming, including JavaScript as used in HTML5 as well as related web technologies. Through examples based on JavaScript libraries, web applets, and even full web apps, either alone or coupled to each other, the article puts on the spotlight the potential of these technologies for carrying out numerical calculations, text processing and mining, retrieval and analysis of data through queries to online databases and web services, effective visualization of data including 3D visualization and even virtual and augmented reality; all of them in the browser at relatively low programming effort, with applications in cheminformatics, structural biology, biophysics, and genomics, among other molecular sciences. Full article
Figures

Figure 1

Open AccessArticle
Multidimensional Data Exploration by Explicitly Controlled Animation
Informatics 2017, 4(3), 26; doi:10.3390/informatics4030026 -
Abstract
Understanding large multidimensional datasets is one of the most challenging problems in visual data exploration. One key challenge that increases the size of the exploration space is the number of views that one can generate from a single dataset, based on the use
[...] Read more.
Understanding large multidimensional datasets is one of the most challenging problems in visual data exploration. One key challenge that increases the size of the exploration space is the number of views that one can generate from a single dataset, based on the use of multiple parameter values and exploration paths. Often, no such single view contains all needed insights. The question thus arises of how we can efficiently combine insights from multiple views of a dataset. We propose a set of techniques that considerably reduce the exploration effort for such situations, based on the explicit depiction of the view space, using a small multiple metaphor. We leverage this view space by offering interactive techniques that enable users to explicitly create, visualize, and follow their exploration path. This way, partial insights obtained from each view can be efficiently and effectively combined. We demonstrate our approach by applications using real-world datasets from air traffic control, software maintenance, and machine learning. Full article
Figures

Figure 1