Designed especially for neurobiologists, FluoRender is an interactive tool for multi-channel fluorescence microscopy data visualization and analysis.

BrainStimulator is a set of networks that are used in SCIRun to perform simulations of brain stimulation such as transcranial direct current stimulation (tDCS) and magnetic transcranial stimulation (TMS).

Developing software tools for science has always been a central vision of the SCI Institute.

Visualization

Visualization, sometimes referred to as visual data analysis, uses the graphical representation of data as a means of gaining understanding and insight into the data. Visualization research at SCI has focused on applications spanning computational fluid dynamics, medical imaging and analysis, biomedical data analysis, healthcare data analysis, weather data analysis, poetry, network and graph analysis, financial data analysis, etc.

Research involves novel algorithm and technique development to building tools and systems that assist in the comprehension of massive amounts of (scientific) data. We also research the process of creating successful visualizations.

We strongly believe in the role of interactivity in visual data analysis. Therefore, much of our research is concerned with creating visualizations that are intuitive to interact with and also render at interactive rates.

Visualization at SCI includes the academic subfields of Scientific Visualization, Information Visualization and Visual Analytics.

Charles Hansen

Volume Rendering
Ray Tracing
Graphics

Valerio Pascucci

Topological Methods
Data Streaming
Big Data

Chris Johnson

Scalar, Vector, and
Tensor Field Visualization,
Uncertainty Visualization

Mike Kirby

Uncertainty Visualization

Ross Whitaker

Topological Methods
Uncertainty Visualization

Alex Lex

Information Visualization

Bei Wang

Information Visualization
Scientific Visualization
Topological Data Analysis

Centers and Labs:

Funded Research Projects:

SCALE MoDL: Advancing Theoretical Minimax Deep Learning: Optimization, Resilience, and Interpretability

Bei Wang
The past decade has witnessed the great success of deep learning in broad societal and commercial applications. However, conventional deep learning relies on fitting data with neural networks, which is known to produce models that lack resilience. The next-generation deep learning paradigm needs to deliver resilient models that promote robustness to malicious attacks, fairness among users, and privacy preservation. In this project, the investigators will collaboratively develop a comprehensive minimax learning theory that advances the fundamental understanding of minimax deep learning from the perspectives of optimization, resilience, and interpretability.

Enabling Reproducibility of Interactive Visual Data Analysis

Alex Lex
Reproducibility and justifiability are widely recognized as critical aspects of data-driven decision making in fields as varied as scientific research, business, healthcare, or intelligence analysis. This project is concerned with enabling reproducibility and justifiability of decisions in the data analysis process, specifically as it relates to visual data analysis. Visualization is an important tool for discovery, yet decisions made by humans based on visualizations of data are difficult to capture and to justify. This project will develop methods to justify, communicate, and audit decisions made based on visual analysis. This, in turn will lead to better outcomes, achieved with less effort and cost. The increasing use of visual analysis tools for decision making will make data analysis accessible to a broad variety of people, as visual analysis tools are generally easier to use than scripting languages and do not require extensive computational and statistical training. This research and its related activities increase accessibility and enhance the data analysis infrastructure for research and education.

To achieve these goals, this research will develop a framework for making visual analysis sessions not only reproducible but also reusable. The approach is based on tracking semantically meaningful provenance data during an interactive visual analysis session. Once a discovery is made, analysts can use this history to curate a succinct analysis story, adding justifications and explanations to make their analysis reproducible by others. Using a semi-automatic process, analysts will be able to make their actions data-aware, so that their analysis processes become robust to changes, such as updates in the data. A second contribution of the proposed work is the integration of visual analysis into computational analysis processes. While visualization is commonly used to present computational analysis results, the results of a visual analysis session are rarely used to feed into further computational processes. The techniques developed in this project will allow analysts to feed analysis results (selections, aggregations, filters, etc.) back into a computational environment. This will make it possible to use interactive visualization at any point in the data analysis process while maintaining reproducibility and enabling reuse. The expected results include new methods to capture user intent, create data stories from analysis processes, and to integrate computational and visual data analysis, leveraging the strength of both, human abilities and computational power. The results will be disseminated in publications and in the form of open source software, and accessible via the project website (http://vdl.sci.utah.edu/projects/2018-nsf-reproducibility/).

This award reflects NSF's statutory mission and has been deemed worthy of support through evaluation using the Foundation's intellectual merit and broader impacts review criteria.

Reproducible Visual Analysis of Multivariate Networks with Multinet

Miriah Meyer, Bryan Jones, Alexander Lex
Multivariate networks -- datasets that link together entities that are associated with multiple different variables -- are a critical data representation for a range of high-impact problems, from understanding how our bodies work to uncovering how social media influences society. These data representations are a rich and complex reflection of the multifaceted relationships that exist in the world. Reasoning about a problem using a multivariate network allows an analyst to ask questions beyond those about explicit connectivity alone: Do groups of social-media influencers have similar backgrounds or experiences? Do species that co-evolve live in similar climates? What patterns of cell-types support different types of brain functions? Questions like these require understanding patterns and trends about entities with respect to both their attributes and their connectivity, leading to inferences about relationships beyond the initial network structure. As data continues to become an increasingly important driver of scientific discovery, datasets of networks have also become increasingly complex. These networks capture information about relationships between entities as well as attributes of the entities and the connections. Tools used in practice today provide very limited support for reasoning about networks and are also limited in the how users can interact with them. This lack of support leaves analysts and scientists to piece together workflows using separate tools, and significant amounts of programming, especially in the data preparation step. This project aims fill this critical gap in the existing cyber-infrastructure ecosystem for reasoning about multivariate networks by developing MultiNet, a robust, flexible, secure, and sustainable open-source visual analysis system.

MultiNet aims to change the landscape of visual analysis capabilities for reasoning about and analyzing multivariate networks. The web-based tool, along with an underlying plug-in-based framework, will support three core capabilities: (1) interactive, task-driven visualization of both the connectivity and attributes of networks, (2) reshaping the underlying network structure to bring the network into a shape that is well suited to address analysis questions, and (3) leveraging provenance data to support reproducibility, communication, and integration in computational workflows. These capabilities will allow scientists to ask new classes of questions about network datasets, and lead to insights about a wide range of pressing topics. To meet this goal, we will ground the design of MultiNet in four deeply collaborative case studies with domain scientists in biology, neuroscience, sociology, and geology.

This award reflects NSF's statutory mission and has been deemed worthy of support through evaluation using the Foundation's intellectual merit and broader impacts review criteria.

Visualizing Robust Features in Vector and Tensor Fields

Bei Wang
Vector and tensor fields provide a powerful language to describe physical phenomena in many scientific applications. In atmospheric sciences, vectors are used to represent air movements with speed and directions and to capture typical and atypical atmospheric conditions. In materials science, stress and strain tensors are used to specify the behaviors of material bodies experiencing deformations and to facilitate the study of material strength. The main objective of this project is to define and quantify robust features in vector and tensor fields and to derive scientifically meaningful visualization for knowledge discovery. Robust features are objects, structures, or regions of interest that are stable under small perturbations of the data that arise from measurement noise, numerical instability or simulation uncertainty. Robust features are defined and evaluated via close collaborations with domain scientists to help them discriminate spurious from essential structures in the data. In materials science, the extraction of robust features in stress tensor fields will help the materials scientists better characterize and predict 3D cracking for manufacturing stronger materials. In neuroscience, quantifying the robustness of degenerate elements in brain imaging will offer new metrics and visualization in characterizing tissue microstructure for disease diagnostics. In bioengineering, robust vortex extraction and tracking of 3D conduction velocity fields in the heart will help bioengineers develop new metrics that detect and characterize ischemic stress associated with a heart attack. In atmospheric sciences, extracting and visualizing robust features in wind data will help the atmospheric scientists establish situation awareness of hazardous weather conditions such as wildfires and to provide wildfire weather forecasting and resource planning for firefighting personnel. This project will also provide a unique environment for multidisciplinary activities and training opportunities for students in integrating visualization with scientific applications.

This project will establish a new approach to feature-based visualization with three interconnected aims. First, it will derive novel mathematical formulations of robust features for vector and tensor fields and their ensembles. Second, it will develop new robustness-driven algorithms in feature extraction, tracking, simplification, visual representation, and uncertainty visualization. Third, it will apply and evaluate the proposed framework via close collaborations with scientists in four high-impact application areas: materials science, neuroscience, bioengineering, and atmospheric sciences. Using simulated micro-mechanical fields in an uncracked polycrystal, the project will integrate robust features with visualization to improve the interpretability of micro-mechanical fields and the prediction of fatigue-failure surfaces. Using diffusion tensor imaging (DTI) from the Human Connectome Project, the project will investigate quantifiable characteristics of crossing fibers as part of a long-term goal for deep brain stimulator placement. Using 3D conduction velocity generated in volumes of swine and canine tissues, the project will generate feature-based signatures from vortex stability and evolution and use them, in the long term, for disease diagnostics and medical intervention. Using ensemble datasets generated from the High-Resolution Rapid Refresh Model (HRRR), the project will use robust features in the visualization and statistical analysis of atmospheric models to identify atypical atmospheric conditions for wildfire weather assessment. The research results will be instantiated by a collection of research papers and open-source software tools targeting the communities of collaborating scientists and the large research community. These software tools will be made available via GitHub under MIT or BSD licenses.

This award reflects NSF's statutory mission and has been deemed worthy of support through evaluation using the Foundation's intellectual merit and broader impacts review criteria.

EAGER: Understanding and Mitigating Misinformation in Visualizations on Social Media

Alexander Lex
In a time of crisis, such as during a hurricane or a global pandemic, social media is an important source of information for the general population. In these scenarios, data visualizations are often used to convey information that is critical for decision making by individuals. For example, a visualization of the path of a hurricane can inform the affected population about the need to prepare or evacuate; while a visualization about the prevalence of a disease in a certain area can inform personal choices, such as limiting interactions with others during a relevant time period. Visualizations, however, can be flawed, which can lead to misinterpretation of the data, and, in a crisis, lead to decisions with negative consequences. This project seeks to identify aspects of visualizations that makes them widely shared, identify flaws a visualization might have, and warn social media users about them. Ultimately, this project can lead to better responses to a crisis by the general population, and contribute to improving visualization literacy. Finally, this project will also enable the training of two graduate students, provide opportunities for undergraduate research, and curate material that can be leveraged by educators teaching about visualization design.

These goals will be achieved by applying existing and novel methods, such as topic modeling and calculating measures of social attention, to three large dataset of social media posts related to recent crisis. Using a qualitative coding approach, a taxonomy of design problems will be developed. This taxonomy will be used to label a large dataset. Finally, a prototype intervention in the form of a plug-in that warns of problematic visualizations, but also enables users to classify problems with visualizations they encounter, will be developed. The dataset and the annotations compiled in the course of this project will be shared publicly. The software created will be released under a permissive, non-viral open source license.

This award reflects NSF's statutory mission and has been deemed worthy of support through evaluation using the Foundation's intellectual merit and broader impacts review criteria.

FluoRender: Visualization-Based and Interactive Analysis for Multi-Channel Microscopy Data

Chuck Hansen
FluoRender is a software package for visualizing and analyzing 3D and 4D (3D over time) fluorescence microscopy data. This project will serve the needs of biologists utilizing confocal microscopy for understanding cell development in many organisms and addresses the big-data problem from the massive increase of imaging data from modern high-resolution fluorescence microscopes.

Specific Aim 1 : Visualization of an extended number of volume channels: FluoRender will be enhanced with the multichannel visualization capability by simultaneously supporting several tens to hundreds of channels, which can be acquired from multispectral imaging devices or by registering data of multiple scans. FluoRender will take advantage of the latest volume rendering techniques to visualize significantly improved signal intensity detail compared to pseudo-surfaces.

Specific Aim 2 : Interactive comparison and organization of volume channels: A package of measures will be implemented in FluoRender for directly comparing volume channels. Leveraging the OpenCL programming interface, shape comparisons will be performed interactively on graphics hardware, allowing compound measures for complex morphology as well as immediate visual feedback via multichannel visualization. Interactive comparison will further enable the development of functions for semiautomatic channel organization and multichannel colocalization analysis.

Specific Aim 3 : 4D tracking of structures with irregular and changing shapes: Tracking irregularly shaped and shape-changing structures will substantially expand FluoRender's application for developmental and morphological studies of intracellular organelles, cells, and tissues. This will include a comprehensive tracking system that integrates different modules and allows them to work in an iterative and integrated environment, allowing user-guided, progressive refinement of the segmentation and tracking results.

Specific Aim 4. Fully hardware-accelerated and customizable computing modules: FluoRender will be restructures using compute modules based on the OpenCL standard, which provides not only hardware-accelerated execution speed, but also convenience for customization and reuse. Computing modules will be integrated with visualization features, enabling interactive and visualization-centered analysis. Users will also be able to reorganize and build modules to customize specific workflows for great adaptability.

Public Health Relevance
FluoRender is a software package for visualizing and analyzing 3D and 4D (3D over time) fluorescence microscopy data. This project will serve the needs of biologists utilizing confocal microscopy for understanding cell development in many organisms and addresses the big-data problem from the massive increase of imaging data from modern high-resolution fluorescence microscopes.

CPS: Synergy: A Layered Framework of Sensors, Models, Land-Use Information and Citizens for Understanding Air Quality in Urban Environments

Miriah Meyer, Ross Whitaker, Kerry Kelly, Pierre-Emmanuel Gaillardon
Poor air quality has been linked to not just adverse health effects such as increased incidence of cardiac arrhythmia, lung cancer, heart disease, and mortality, but also to the vitality of a region’s economy. These issues are particularly important in cities such as Salt Lake City (SLC), where topography, climate, and urban expansion combine to create some of the worst air quality episodes in the country. Cities like SLC currently rely on small numbers of expensive sensors placed across a large geographic area to measure air quality, making local, neighborhood-level measurements impossible to determine. Meanwhile, new commodity technologies are leading to fine-grained, community-based strategies for measuring and communicating air quality. Leveraging both of these approaches, this project will develop and deploy a dense, distributed, and dynamic air quality cyber-physical framework -- focusing on fine particulate matter and using SLC as an urban testbed -- to produce neighborhood-level estimates of air quality. The framework includes a network of low-cost sensors, hosted and maintained through a citizen science effort and maker-kit approach.

This research will result in novel developments in three areas: (i) sensor development that focuses on dramatically reducing cost and a movement toward cheap, wearable, passive sensors; (ii) computational modeling that combines heterogeneous sensor measurements with information about weather, topography, and land use patterns; and (iii) visualization interface design that communicates air quality estimates over space and time, coupled with related uncertainty measurements. Each of these areas requires a multidisciplinary approach that integrates existing and novel insights about sensor networks, computational modeling, and sense-making of data, as well as leveraging an engaged and connected community of residents through citizen science.

SBIR Phase II Immediate Delivery of Massive Aerial Imagery to Farmers and Crop Consultants

Valerio Pascucci, Amy Gooch
This Small Business Innovation Research (SBIR) Phase II project will accelerate the adoption of data intensive precision agriculture, increasing yields while decreasing farm inputs such as fertilizers and pesticides. This project removes the software bottleneck (time and labor) in processing large aerial surveys taken by Unmanned Aerial Systems, enabling a cost-effective and timely process to deliver actionable information to farmers. Using frequent high-quality aerial scans, farmers may optimize the use of fertilizers and more finely control the amount of pesticides and herbicides necessary to increase crop yield. Furthermore, farmers mitigate costs and losses by being able to spot problem areas, minimize the spread of plant diseases, and identify issues such as standing water, irrigation malfunctions, and persistent automated machinery errors in planting or cultivation. This project provides special benefit for rural customers having inadequate internet infrastructure by eliminating the need to upload massive imagery to the cloud for processing. The technology is part of a broad initiative in agriculture addressing the need for large increases in food production by 2050 in response to the projected growth of the world’s population to over 9 Billion people.

This project will continue development of algorithms for on-the-fly orthorectification, stitching, and normalization of aerial image mosaics and their deployment in an easy-to-use software prototype. The Phase I already demonstrated industry-leading speeds for such image processing. The technology behind this research project is designed from the ground up to process massive data with less memory and increased speed relative to other approaches, enabled by a proprietary streaming image representation, that allows multichannel gigapixel and terapixel images to be treated as ordinary images. This Phase II supports new extensions to the software that simplify and accelerate delivering a stitched and analyzed map, such as prioritizing computation in regions of the image that a customer is exploring. This would effectively eliminate the delay between image acquisition on unmanned aerial vehicles and when it can be used. Crop consultants have identified this as a transformative capability, as it enables ground-truthing information derived from aerial imagery in the same field visit, saving time and labor. The performance gains in compute-limited environments supported by this project are a key link between new capabilities to gather information and a farmer’s ability to utilize it to increase productivity while reducing costs.

Topology-Preserving Data Sketching for Scientific Visualization

Bei Wang
We are experiencing an information overload from streams of data that arise from scientific instruments and simulations. For example, material scientists use molecular dynamics (MD) simulations to study how fluids (such as gas, oil, and water) interact with heterogeneous porous solids (such as ceramics, cement, and rock) to improve transport phenomena within porous materials, which play critical roles in our energy sector. Such simulations generate large, time-varying, and complex forms of data under different physical and chemical conditions. Keeping track of interesting phenomena and applying appropriate actions (such as storage, analysis, and visualization) while the simulation is running is necessary but challenging. To address this challenge, the goal is no longer to capture and store observations or simulation in detail, but rather to process data efficiently and approximately in order to create a summary - a sketch - which allows queries over large volumes of data to be answered quickly.

The objective of this research is to conduct a systematic study of topology-preserving data sketching techniques to improve visual exploration and understanding of large scientific data. The project will employ topological sketches, that is, compressed representations of the full data that preserve their important structural properties, to support analysis and visualization as the data are generated. Our proposed solution transforms data sketching ideas from statistics, geometry, and linear algebra to develop new topological sketches of complex data. Such sketches will exploit the high spatial resolution and temporal fidelity of in situ data in an intelligent and scalable way. They will reduce data in situ while preserving its structural properties, and subsequently support interactive data exploration. In addition, topological triggers will be integrated into an adaptive workflow to support anomaly detection, computational steering, and decision optimization. The multidisciplinary nature of the proposed work will be broadly applicable in many scientific areas, including applications in computational fluid dynamics and materials science.

Novel 3d Experiments and Simulations Combined with Genetic Optimization for Accelerated Design of Metallic Foams

Valerio Pascucci
Open-cell metallic foams are an exciting class of structural materials that comprise a network of interconnected metallic ligaments, resulting in an interesting foam architecture. These low-density materials have garnered much attention over the past two decades based on their recognized potential for use in multi-functional applications. For example, in addition to serving as light-weight, load-bearing structures, open-cell metallic foams have the potential to serve concurrently as electrodes for energy-storage devices, as hosts for newly generated bone and blood vessels in biomedical implants, or as impact absorbers and noise insulators for advanced high-speed ground transportation. Despite their potential, the widespread deployment of open-cell metallic foams for a broader range of multi-functional applications remains hampered by inefficient, trial-and-error manufacturing approaches. This Designing Materials to Revolutionize and Engineer our Future (DMREF) Grant Opportunities for Academic Liaison with Industry (GOALI) award supports a joint academic-industry research effort to enable more efficient and intelligent design of open-cell metallic foams, and to achieve precise control over their performance for targeted applications. The results will provide dramatic improvements for the industry by increasing both the manufacturing efficiency and the tailorability of the foams, which will help to expand deployment of the foams throughout the energy, defense, biomedical, aerospace, and automotive industries. The research team will host outreach activities to expose students in K-12, undergraduate, and graduate school to this multi-disciplinary STEM research.

This DMREF GOALI award supports research to enable an accelerated and performance-based design paradigm for open-cell metallic foams through the integration of emergent methods in 3D materials characterization with multi-scale modeling and Bayesian optimization. The new design paradigm will be made possible through the discovery of process-structure-property relationships in the foams. The specific objectives include: experimentally modifying manufacturing parameters to produce variants of open-cell metallic foams; performing 3D synchrotron-based crystal-orientation measurements and in-situ X-ray computed tomography experiments to gain unprecedented insight into the hierarchical structure and multi-scale deformation mechanisms of the foam; using high-fidelity, multi-scale (grain-to-continuum) finite-element modeling to investigate micromechanical behavior and predict performance of the as-manufactured foams; conducting virtual tests on synthetic-foam variants to further populate a metallic-foam design space; and using Bayesian optimization on the simulation-based results to enable selection of optimal hierarchical structures (i.e. topology and crystallography) for targeted performance metrics. The research will be a first to decouple the effects of ligament topology and underlying crystal structure on micromechanical behavior of open-cell metallic foams (including microbuckling, local accumulation of slip, and distribution of crack-nucleation sites), which is postulated to influence its performance.

A Scalable Framework for Visual Exploration and Hypotheses Extraction of Phenomics Data

Bei Wang
Understanding how gene by environment interactions result in specific phenotypes is a core goal of modern biology and has real-world impacts on such things as crop management. Developing and managing successful crop practices is a goal that is fundamentally tied to our national food security. By applying novel computational visual analytical methods, this project seeks to identify and unravel the complex web of interactions linking genotypes, environments and phenotypes. These methods will first need to be designed and developed into usable software applications that can handle large volumes of crop phenomics data. High-throughput sensing technologies collect large volumes of field data for many plant traits, such as flowering time, related to crop development and production. The maize cultivars used here come from multiple genotypes that have been grown under a variety of environmental conditions, in order to give the widest range of conditions for understanding the interactions. The resulting data sets are growing quickly, both in size and complexity, but the analytical tools needed to extract knowledge and catalyze scientific discoveries have significantly lagged behind. The methodologies to be developed in this project represent a systematic attempt at bridging this rapidly widening divide. The project is inherently interdisciplinary, involving close research partnerships among computer scientists, plant scientists, and mathematicians. The research outcomes will be tightly integrated with education using a multipronged approach that includes, among others, postdoctoral and student training (graduates and undergraduates), curriculum development for a new campus-wide interdisciplinary undergraduate degree in Data Analytics, conference tutorials for training phenomics data practitioners, and contribution to the recruitment and retention of underrepresented minorities (particularly women) in STEM fields through the Pacific Northwest Louis Stokes Alliance for Minority Participation.

This project will lead to the design and development of a new, scalable, visual analytics platform suitable for hypothesis extraction and refinement from complex phenomics data sets. Focus on hypothesis extraction is critical in the context of phenomics data sets because much of the high-throughput sensing data being generated in crop fields are generated in the absence of specifically formulated hypotheses. Extracting plausible hypotheses from the data represents an important but tedious task. To this end, this project will apply and develop new capabilities using emerging advanced algorithmic principles, particularly from the branch of mathematics called algebraic topology that studies shapes and structure of complex data. The research objectives are three-fold. First, the project will employ and extend emerging algorithmic techniques from algebraic topology to decode the structure of large, complex phenomics data. Second, an interactive visual analytic platform will be developed to facilitate knowledge discovery using the extracted topological structures. Lastly, the quality and validity of a new visual analytic platform designed by this team will be tested using real-world maize data sets as well as simulated inputs as testbeds. The developed framework will encode functions for scientists to delineate hypotheses of three kinds: i) genetic characterization of single complex traits; ii) genetic characterization of multiple traits that share potentially pleiotropic effects; and iii) decoding and detailed characterization of genotype-by-environmental interactions, in particular, through a collaborative pilot study of maize flowering and growth traits. The expected significance of the proposed work is that biologists will be able to extract different types of testable hypotheses from plant phenomics data sets by employing a new class of visual analytic tools, and thus obtain a deeper understanding of the interactions among genotypes, environments and phenotypes. The project is potentially transformative in two ways: i) it will introduce advanced mathematical and computational principles into mainstream phenomic data analysis; and ii) it will usher in a new era where biologists spearhead data-driven hypothesis extraction and discovery with the aid of interactive, informative, and intuitive tools. The project will have a direct impact on the state of software in phenomics for fundamental data-driven discovery. To facilitate broader community adoption, the project will integrate the tools into the CyVerse Institute, and to a community phenomics software outlet. It will also lead to the development of automated scientific workflows. Project website: http://tdaphenomics.eecs.wsu.edu/.

COVID - RAPID: Building a Visual Consensus Model of the SARS-CoV-2 Life Cycle

Janet Iwasa, Miriah Meyer
The COVID-19 epidemic has motivated hundreds (if not thousands) of biological researchers around the globe to redirect their research efforts towards the understanding of SARS-CoV-2. This is leading to an explosion of data and it will be essential to find ways to rapidly digest and integrate new information into a context that facilitates consensus building in the research community. How do researchers and the broader community stay abreast of this flood of information? And how can we quickly move towards building a consensus model of the SARS-CoV-2 life cycle that builds on this explosive body of scientific data and expertise? This work proposes to take a novel and intuitive approach to facilitate scientific discourse and dissemination through the development of: (1) detailed molecular 3-D depictions that put a diverse dataset into the context of the SARS-CoV-2 life cycle, and; (2) provide for annotation tools to be used by researchers to explore and capture scientific discussions that will speed up consensus building to promote a mechanistic understanding of how this virus works. If successful, the work will reduce the time of consensus building from years to months. In addition, a graduate student and postdoc will receive training at the intersection of biological and computer sciences.

Specifically, researchers will work with an international group of SARS-CoV-2 experts to develop detailed and accurate visualizations of all stages of the viral life cycle including cellular entry, RNA replication and transcription, and viral assembly and egress with known energy states, rates, and spatial accuracy. These 3-D visualizations, which will be made freely available online, will be used to stimulate discussions within the scientific community, and will be iteratively updated based on community feedback and new data. To facilitate consensus building, annotation tools will be developed to interactively describe the data used to generate the visualizations and will also mediate and capture scientific discourse surrounding the various molecular mechanisms involved in viral infection. This project will rapidly produce a rich and publicly accessible collection of knowledge about SARS-CoV-2 biology for the global community.

This award reflects NSF's statutory mission and has been deemed worthy of support through evaluation using the Foundation's intellectual merit and broader impacts review criteria.

OpenSpace: An Engine for Dynamic Visualization of Earth and Space Science for Informal Education and Beyond

Chuck Hansen
The American Museum of Natural History (AMNH), in collaboration with informal science institutions (ISI), NASA mission teams and Subject Matter Experts (SME), and academic partners, seeks support for a five-year project to enable STEM education and improve U.S. scientific literacy by engaging a broad spectrum of the American public and STEM learners in cutting-edge NASA science and engineering content.

This project will develop an open source software, called OpenSpace, for visualizing NASA astrophysics, heliophysics, planetary science, and Earth science mission engineering activities and science results for the general public, students, teachers, and citizen scientists everywhere. The project will develop and widely disseminate OpenSpace; create innovative and networked programs with ISI partners; produce educational resources for middle and high school teachers and students; and establish robust partnerships with NASA SMD missions, ISIs, and visualization research centers.

The project is based on the success of pilot efforts to visualize the New Horizons mission and heliophysics and space weather simulation data generated by NASA Goddard’s Community Coordinated Modeling Center. It builds on AMNH’s expertise in science visualization and its record of success in partnering with NASA to develop innovative programming, exhibitions, and Space Shows that engage, inspire, and educate students, teachers, and learners of all ages.

Drawing together a highly qualified and exceptionally talented team of scientists, educators, software engineers, and visualization specialists, the project’s aim is to build a pipeline for transmitting visualized science content from across NASA SMD divisions to ISIs, secondary school classrooms, and the public.

To do so, the project proposes the following objectives:

Develop OpenSpace into a robust and flexible interactive visualization software that supports the presentation of dynamic data sets and that is easily updated for the presentation of current science.
Form a network of ISIs to inform the development of OpenSpace and develop associated programming to engage and educate diverse audiences.
Disseminate OpenSpace via the web to individual users, including teachers as a key audience, with resources for leveraging it as an educational tool.

Project outcomes include:

The establishment of a pipeline connecting NASA SMD content and SMEs with ISIs, secondary school classrooms, and the public.
The development of a new and powerful educational tool for the visualization of a wide range of NASA SMD mission activities and data products.
Enhanced understanding and engagement in STEM among youth, informal and formal educators, and the general public.

Project objectives, activities, and outcomes are closely aligned with, and aim to fulfill, the SMD science education objectives of enabling STEM education, improving U.S. scientific literacy, and advancing national education goals of increasing and sustaining youth and public engagement in STEM and leveraging efforts through partnerships.

Because OpenSpace will be open source, it will be freely accessible to users. It is designed to be compatible with multi-video channel cluster operations for high-resolution wall displays and planetarium domes, as well as for single-channel polar rendering fisheye projections and flat screens, in 2D and 3D. A WebGL version will make it possible for anyone with Internet access to explore OpenSpace. Another core design principle of this project is the ability to network across the Internet to synchronize displays in different locations, creating opportunities for shared experiences of high profile NASA content, including live events. This open source project will have a life far beyond the award period, as it will provide science and education communities access to the source code to modify, enhance, and extend its functionality to best serve audiences in the future.

Extracting the Full Information Content of Astrophysical Data Cubes

Bei Wang
An IFU (Integral Field Unit Spectrometer) allows one to take a high-resolution spectrum at multiple physical locations within an external target. The signal from an astronomical target is distributed into a large number of spaxels (spatial pixels), each with noise from the sky and detectors, and a greatly varying signal to noise ratio across the bundle. IFU bundle technique gives rise to 3-dimensional astrophysical data cubes (two spatial directions and one frequency direction) that require advanced analysis techniques to extract their salient features. In many cases the complex kinematic structure of features of interest further complicates the problem. Furthermore, it is intrinsically difficult to visualize such data and common analysis techniques often involve slicing the data cube along a particular axis, either at a fixed frequency or a fixed spatial location.

A common type of data from IFU bundle technique is the Mapping Nearby Galaxies at APO (MaNGA) survey, which is part of the Sloan Digital Sky Survey IV (SDSS-IV). PI Phillips and PI Rosen have been working to analyze similar data cubes taken at radio frequencies with the ALMA telescope in Chile (see http://alma-tda.cspaul.com). They have been using sophisticated mathematical techniques known as topological data analysis, in particular the contour tree, in order to extract features and remove noise for visualizing data cubes very similar to the ones arise from IFU.

Objective
We would like to apply advanced data analysis and visualization techniques, in particular, those from topological data analysis, to data observed at UV, optical and infrared wavelengths, in order to extract features that are currently inaccessible. In particular, we would like to start by studying the SDSS-IV MaNGA dataset, to which Carnegie Institution for Science and the University of Utah (where the MaNGA reduction and analysis pipelines are run via the Center for High Performance Computing) have full access as Institutional members (the SDSS Data Scientist, Prof. Joel Brownstein of the University of Utah is a PI on this project).

Furthermore, we will explore the applicability of such techniques to other similar datasets that have been acquired using other IFU facilities.

Topological Analysis for Energetic Materials Characterization

Valerio Pascucci
This statement of work supports ongoing efforts towards improved analysis of characterization and surveillance data of energetic materials. The goals are to: 1) use topological segmentations to analyze microstructural changes under aging; 2) explore extending the analysis tools to characterize fine-prill materials; 3) develop techniques to quantify permeable surface area of a lower-density system; and 4) extract age-trendable features from2D-surface profile data.

Tasks

1. Analyze microstructural changes under aging: At various Aging points (in time-temperature space):

Determine matching scales and simplification levels to create best matching segmentations for each dataset
Develop techniques to affinely align pre- & post-aged data sets for maximal correspondence
Use per-grain matching to analyze material changes over time

2. Explore extending the analysis tools to the characterization of fine-prill materials:

In previous years the Utah technology could successfully analyze X-ray CT data for coarser-prill HE materials. Explore the effectiveness of such technology in performing similar analysis on X-ray CT data for fine-prill systems.

3. Develop techniques to quantify permeable surface area of lower-density systems:

The topological segmentation theory could be used to quantify the permeable surface area of lower-density (e.g., porous-powder) systems, and to compute the gas-flow rate through such a specimen under a given pressure-gradient. CONTINGENCY: Availability of high-quality micro-CT data.

4. Extract age-trendable features fromsurface profilometry data

Analyze 2D height-map data from pellet surfaces (measured using a surface profilometer) and device quantitative features that can be used to track age-related changes in material morphology and performance.

Advanced Visualization of Silent Error Propagation in HPC Applications

Valerio Pascucci
High Performance Computing (HPC) systems contain increasingly large numbers of components. This trend, combined with practical limitations on component reliability, makes HPC systems vulnerable to a wide range of faults. These faults degrade systems efficiency and even threaten the correctness of application results. The problem is expected to grow even more significant for Exascale systems. Designing resilient software to run efficiently on such hardware is challenging, and uncertainty about how failures affect programs only complicates the problem.

Disruptions to the micro‐architectural state of hardware components (e.g., caches, reorder buffers or pipeline registers), may cause these components to crash or compute erroneous results. These errors then propagate through layers of the software stack, including the runtime system, support libraries, and application logic. Local memory access to erroneous results can easily propagate the effects of errors across cores; and the remote memory access on modern networks propagates errors across nodes. The reordered memory accesses in use by memory systems introduces further difficulties by obscuring the consistency (ordering) of memory accesses when errors occur. Identifying the propagation of errors through space and time and quantifying it in terms developers can understand is a major problem for error recovery schemes. This is especially true for scientific applications that rely on complex physical or numerical invariants and for resilience techniques that need to identify consistent states.

The ultimate goal of this research is to provide a visualization of the propagation of errors through application and system software in order to identify for application developers the vulnerability of their data structures and code regions to different types of errors, and the way these errors propagate through application state and logic.

VisStore: Seamless Acquisition, Storage, and Distribution of Massive Imagery

Ease of Use and Deployment for a Fast, Scalable Data Movement Infrastructure

Publications in Visualization:

Page 10 of 23

Start
Prev
5
6
7
8
9
10
11
12
13
14
Next
End

Characterizing Cancer Subtypes using Dual Analysis in Caleydo
C. Turkay, A. Lex, M. Streit, H. Pfister,, H. Hauser. In IEEE Computer Graphics and Applications, Vol. 34, No. 2, pp. 38--47. March, 2014.
ISSN: 0272-1716
DOI: 10.1109/MCG.2014.1

Dual analysis uses statistics to describe both the dimensions and rows of a high-dimensional dataset. Researchers have integrated it into StratomeX, a Caleydo view for cancer subtype analysis. In addition, significant-difference plots show the elements of a candidate subtype that differ significantly from other subtypes, thus letting analysts characterize subtypes. Analysts can also investigate how data samples relate to their assigned subtype and other groups. This approach lets them create well-defined subtypes based on statistical properties. Three case studies demonstrate the approach's utility, showing how it reproduced findings from a published subtype characterization.

Mu-8: Visualizing Differences between Proteins and their Families
J. Mercer, B. Pandian, A. Lex, N. Bonneel,, H. Pfister. In BMC Proceedings, Vol. 8, No. Suppl 2, pp. S5. Aug, 2014.
ISSN: 1753-6561
DOI: 10.1186/1753-6561-8-S2-S5

A complete understanding of the relationship between the amino acid sequence and resulting protein function remains an open problem in the biophysical sciences. Current approaches often rely on diagnosing functionally relevant mutations by determining whether an amino acid frequently occurs at a specific position within the protein family. However, these methods do not account for the biophysical properties and the 3D structure of the protein. We have developed an interactive visualization technique, Mu-8, that provides researchers with a holistic view of the differences of a selected protein with respect to a family of homologous proteins. Mu-8 helps to identify areas of the protein that exhibit: (1) significantly different bio-chemical characteristics, (2) relative conservation in the family, and (3) proximity to other regions that have suspect behavior in the folded protein.

Verifying Volume Rendering Using Discretization Error Analysis
T. Etiene, D. Jonsson, T. Ropinski, C. Scheidegger, J.L.D. Comba, L. G. Nonato, R. M. Kirby, A. Ynnerman,, C. T. Silva. In IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, Vol. 20, No. 1, IEEE, pp. 140-154. January, 2014.

We propose an approach for verification of volume rendering correctness based on an analysis of the volume rendering integral, the basis of most DVR algorithms. With respect to the most common discretization of this continuous model (Riemann summation), we make assumptions about the impact of parameter changes on the rendered results and derive convergence curves describing the expected behavior. Specifically, we progressively refine the number of samples along the ray, the grid size, and the pixel size, and evaluate how the errors observed during refinement compare against the expected approximation errors. We derive the theoretical foundations of our verification approach, explain how to realize it in practice, and discuss its limitations. We also report the errors identified by our approach when applied to two publicly available volume rendering packages.

Curve Boxplot: Generalization of Boxplot for Ensembles of Curves
M. Mirzargar, R. Whitaker, R. M. Kirby. In IEEE Transactions on Visualization and Computer Graphics, Vol. 20, No. 12, IEEE, pp. 2654-63. December, 2014.

In simulation science, computational scientists often study the behavior of their simulations by repeated solutions with variations in parameters and/or boundary values or initial conditions. Through such simulation ensembles, one can try to understand or quantify the variability or uncertainty in a solution as a function of the various inputs or model assumptions. In response to a growing interest in simulation ensembles, the visualization community has developed a suite of methods for allowing users to observe and understand the properties of these ensembles in an efficient and effective manner. An important aspect of visualizing simulations is the analysis of derived features, often represented as points, surfaces, or curves. In this paper, we present a novel, nonparametric method for summarizing ensembles of 2D and 3D curves. We propose an extension of a method from descriptive statistics, data depth, to curves. We also demonstrate a set of rendering and visualization strategies for showing rank statistics of an ensemble of curves, which is a generalization of traditional whisker plots or boxplots to multidimensional curves. Results are presented for applications in neuroimaging, hurricane forecasting and fluid dynamics

RBF Volume Ray Casting on Multicore and Manycore CPUs
A. Knoll, I. Wald, P. Navratil, A. Bowen, K. Reda, M. E. Papka, K. Gaither. In Computer Graphics Forum, Vol. 33, No. 3, Edited by H. Carr and P. Rheingans and H. Schumann, Wiley-Blackwell, pp. 71--80. June, 2014.
DOI: 10.1111/cgf.12363

Modern supercomputers enable increasingly large N-body simulations using unstructured point data. The structures implied by these points can be reconstructed implicitly. Direct volume rendering of radial basis function (RBF) kernels in domain-space offers flexible classification and robust feature reconstruction, but achieving performant RBF volume rendering remains a challenge for existing methods on both CPUs and accelerators. In this paper, we present a fast CPU method for direct volume rendering of particle data with RBF kernels. We propose a novel two-pass algorithm: first sampling the RBF field using coherent bounding hierarchy traversal, then subsequently integrating samples along ray segments. Our approach performs interactively for a range of data sets from molecular dynamics and astrophysics up to 82 million particles. It does not rely on level of detail or subsampling, and offers better reconstruction quality than structured volume rendering of the same data, exhibiting comparable performance and requiring no additional preprocessing or memory footprint other than the BVH. Lastly, our technique enables multi-field, multi-material classification of particle data, providing better insight and analysis.

Approximating Local Homology from Samples
P. Skraba, Bei Wang. In Proceedings 25th Annual ACM-SIAM Symposium on Discrete Algorithms (SODA), pp. 174-192. 2014.

Recently, multi-scale notions of local homology (a variant of persistent homology) have been used to study the local structure of spaces around a given point from a point cloud sample. Current reconstruction guarantees rely on constructing embedded complexes which become diffcult to construct in higher dimensions. We show that the persistence diagrams used for estimating local homology can be approximated using families of Vietoris-Rips complexes, whose simpler construction are robust in any dimension. To the best of our knowledge, our results, for the first time make applications based on local homology, such as stratification learning, feasible in high dimensions.

Overview and State-of-the-Art of Uncertainty Visualization
G.P. Bonneau, H.C. Hege, C.R. Johnson, M.M. Oliveira, K. Potter, P. Rheingans, T. Schultz. In Scientific Visualization: Uncertainty, Multifield, Biomedical, and Scalable Visualization, Edited by M. Chen and H. Hagen and C.D. Hansen and C.R. Johnson and A. Kauffman, Springer-Verlag, pp. 3--27. 2014.
ISBN: 978-1-4471-6496-8
ISSN: 1612-3786
DOI: 10.1007/978-1-4471-6497-5_1

The goal of visualization is to effectively and accurately communicate data. Visualization research has often overlooked the errors and uncertainty which accompany the scientific process and describe key characteristics used to fully understand the data. The lack of these representations can be attributed, in part, to the inherent difficulty in defining, characterizing, and controlling this uncertainty, and in part, to the difficulty in including additional visual metaphors in a well designed, potent display. However, the exclusion of this information cripples the use of visualization as a decision making tool due to the fact that the display is no longer a true representation of the data. This systematic omission of uncertainty commands fundamental research within the visualization community to address, integrate, and expect uncertainty information. In this chapter, we outline sources and models of uncertainty, give an overview of the state-of-the-art, provide general guidelines, outline small exemplary applications, and finally, discuss open problems in uncertainty visualization.

Data-Parallel Halo Finding with Variable Linking Lengths
W. Widanagamaachchi, P.-T. Bremer, C. Sewell, L.-T. Lo; J. Ahrens, V. Pascucci. In Proceedings of the 2014 IEEE 4th Symposium on Large Data Analysis and Visualization (LDAV), pp. 27--34. November, 2014.

State-of-the-art cosmological simulations regularly contain billions of particles, providing scientists the opportunity to study the evolution of the Universe in great detail. However, the rate at which these simulations generate data severely taxes existing analysis techniques. Therefore, developing new scalable alternatives is essential for continued scientific progress. Here, we present a dataparallel, friends-of-friends halo finding algorithm that provides unprecedented flexibility in the analysis by extracting multiple linking lengths. Even for a single linking length, it is as fast as the existing techniques, and is portable to multi-threaded many-core systems as well as co-processing resources. Our system is implemented using PISTON and is coupled to an interactive analysis environment used to study halos at different linking lengths and track their evolution over time.

Towards Paint and Click: Unified Interactions for Image Boundaries
SCI Technical Report, B. Summa, A.A. Gooch, G. Scorzelli, V. Pascucci. No. UUSCI-2014-004, SCI Institute, University of Utah, December, 2014.

Image boundaries are a fundamental component of many interactive digital photography techniques, enabling applications such as segmentation, panoramas, and seamless image composition. Interactions for image boundaries often rely on two complimentary but separate approaches: editing via painting or clicking constraints. In this work, we provide a novel, unified approach for interactive editing of pairwise image boundaries that combines the ease of painting with the direct control of constraints. Rather than a sequential coupling, this new formulation allows full use of both interactions simultaneously, giving users unprecedented flexibility for fast boundary editing. To enable this new approach, we provide technical advancements. In particular, we detail a reformulation of image boundaries as a problem of finding cycles, expanding and correcting limitations of the previous work. Our new formulation provides boundary solutions for painted regions with performance on par with state-of-the-art specialized, paint-only techniques. In addition, we provide instantaneous exploration of the boundary solution space with user constraints. Furthermore, we show how to increase performance and decrease memory consumption through novel strategies and/or optional approximations. Finally, we provide examples of common graphics applications impacted by our new approach.

In-situ feature extraction of large scale combustion simulations using segmented merge trees
A.G. Landge, V. Pascucci, A. Gyulassy, J.C. Bennett, H. Kolla, J. Chen, P.-T. Bremer. In Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis (SC 2014), New Orleans, Louisana, IEEE Press, Piscataway, NJ, USA pp. 1020--1031. 2014.
ISBN: 978-1-4799-5500-8
DOI: 10.1109/SC.2014.88

The ever increasing amount of data generated by scientific simulations coupled with system I/O constraints are fueling a need for in-situ analysis techniques. Of particular interest are approaches that produce reduced data representations while maintaining the ability to redefine, extract, and study features in a post-process to obtain scientific insights.

This paper presents two variants of in-situ feature extraction techniques using segmented merge trees, which encode a wide range of threshold based features. The first approach is a fast, low communication cost technique that generates an exact solution but has limited scalability. The second is a scalable, local approximation that nevertheless is guaranteed to correctly extract all features up to a predefined size. We demonstrate both variants using some of the largest combustion simulations available on leadership class supercomputers. Our approach allows state-of-the-art, feature-based analysis to be performed in-situ at significantly higher frequency than currently possible and with negligible impact on the overall simulation runtime.

Efficient I/O and storage of adaptive-resolution data
S. Kumar, J. Edwards, P.-T. Bremer, A. Knoll, C. Christensen, V. Vishwanath, P. Carns, J.A. Schmidt, V. Pascucci. In Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis, IEEE Press, pp. 413--423. 2014.
DOI: 10.1109/SC.2014.39

We present an efficient, flexible, adaptive-resolution I/O framework that is suitable for both uniform and Adaptive Mesh Refinement (AMR) simulations. In an AMR setting, current solutions typically represent each resolution level as an independent grid which often results in inefficient storage and performance. Our technique coalesces domain data into a unified, multiresolution representation with fast, spatially aggregated I/O. Furthermore, our framework easily extends to importance-driven storage of uniform grids, for example, by storing regions of interest at full resolution and nonessential regions at lower resolution for visualization or analysis. Our framework, which is an extension of the PIDX framework, achieves state of the art disk usage and I/O performance regardless of resolution of the data, regions of interest, and the number of processes that generated the data. We demonstrate the scalability and efficiency of our framework using the Uintah and S3D large-scale combustion codes on the Mira and Edison supercomputers.

Robust Detection of Singularities in Vector Fields
H. Bhatia, A. Gyulassy, H. Wang, P.-T. Bremer, V. Pascucci . In Topological Methods in Data Analysis and Visualization III, Mathematics and Visualization, Springer International Publishing, pp. 3--18. March, 2014.
DOI: 10.1007/978-3-319-04099-8_1

Recent advances in computational science enable the creation of massive datasets of ever increasing resolution and complexity. Dealing effectively with such data requires new analysis techniques that are provably robust and that generate reproducible results on any machine. In this context, combinatorial methods become particularly attractive, as they are not sensitive to numerical instabilities or the details of a particular implementation. We introduce a robust method for detecting singularities in vector fields. We establish, in combinatorial terms, necessary and sufficient conditions for the existence of a critical point in a cell of a simplicial mesh for a large class of interpolation functions. These conditions are entirely local and lead to a provably consistent and practical algorithm to identify cells containing singularities.

Scientific Visualization: Uncertainty, Multifield, Biomedical, and Scalable Visualization,
C.D. Hansen, M. Chen, C.R. Johnson, A.E. Kaufman, H. Hagen (Eds.). Mathematics and Visualization, Springer, 2014.
ISBN: 978-1-4471-6496-8

Surface boxplots
M.G. Genton, C.R. Johnson, K. Potter, G. Stenchikov, Y. Sun. In Stat Journal, Vol. 3, No. 1, pp. 1--11. 2014.

In this paper, we introduce a surface boxplot as a tool for visualization and exploratory analysis of samples of images. First, we use the notion of volume depth to order the images viewed as surfaces. In particular, we define the median image. We use an exact and fast algorithm for the ranking of the images. This allows us to detect potential outlying images that often contain interesting features not present in most of the images. Second, we build a graphical tool to visualize the surface boxplot and its various characteristics. A graph and histogram of the volume depth values allow us to identify images of interest. The code is available in the supporting information of this paper. We apply our surface boxplot to a sample of brain images and to a sample of climate model outputs.

Multivariate Volume Visualization through Dynamic Projections
Shusen Liu, Bei Wang, J.J. Thiagarajan, P.-T. Bremer, V. Pascucci. In Proceedings of the IEEE Symposium on Large Data Analysis and Visualization (LDAV), 2014.

We propose a multivariate volume visualization framework that tightly couples dynamic projections with a high-dimensional transfer function design for interactive volume visualization. We assume that the complex, high-dimensional data in the attribute space can be well-represented through a collection of low-dimensional linear subspaces, and embed the data points in a variety of 2D views created as projections onto these subspaces. Through dynamic projections, we present animated transitions between different views to help the user navigate and explore the attribute space for effective transfer function design. Our framework not only provides a more intuitive understanding of the attribute space but also allows the design of the transfer function under multiple dynamic views, which is more flexible than being restricted to a single static view of the data. For large volumetric datasets, we maintain interactivity during the transfer function design via intelligent sampling and scalable clustering. Using examples in combustion and climate simulations, we demonstrate how our framework can be used to visualize interesting structures in the volumetric space.

Visual Exploration of High-Dimensional Data: Subspace Analysis through Dynamic Projections
SCI Technical Report, Shusen Liu, Bei Wang, J.J. Thiagarajan, P.-T. Bremer, V. Pascucci. No. UUSCI-2014-003, SCI Institute, University of Utah, 2014.

Understanding high-dimensional data is rapidly becoming a central challenge in many areas of science and engineering. Most current techniques either rely on manifold learning based techniques which typically create a single embedding of the data or on subspace selection to find subsets of the original attributes that highlight the structure. However, the former creates a single, difficult-to-interpret view and assumes the data to be drawn from a single manifold, while the latter is limited to axis-aligned projections with restrictive viewing angles. Instead, we introduce ideas based on subspace clustering that can faithfully represent more complex data than the axis-aligned projections, yet do not assume the data to lie on a single manifold. In particular, subspace clustering assumes that the data can be represented by a union of low-dimensional subspaces, which can subsequently be used for analysis and visualization. In this paper, we introduce new techniques to reliably estimate both the intrinsic dimension and the linear basis of a mixture of subspaces extracted through subspace clustering. We show that the resulting bases represent the high-dimensional structures more reliably than traditional approaches. Subsequently, we use the bases to define different “viewpoints”, i.e., different projections onto pairs of basis vectors, from which to visualize the data. While more intuitive than non-linear projections, interpreting linear subspaces in terms of the original dimensions can still be challenging. To address this problem, we present new, animated transitions between different views to help the user navigate and explore the high-dimensional space. More specifically, we introduce the view transition graph which contains nodes for each subspace viewpoint and edges for potential transition between views. The transition graph enables users to explore both the structure within a subspace and the relations between different subspaces, for better understanding of the data. Using a number of case studies on well-know reference datasets, we demonstrate that the interactive exploration through such dynamic projections provides additional insights not readily available from existing tools.

Keywords: High-dimensional data, Subspace, Dynamic projection

Design Activity Framework for Visualization Design
S. McKenna, D. Mazur, J. Agutter, M.D. Meyer. In IEEE Transactions on Visualization and Computer Graphics (TVCG), 2014.

An important aspect in visualization design is the connection between what a designer does and the decisions the designer makes. Existing design process models, however, do not explicitly link back to models for visualization design decisions. We bridge this gap by introducing the design activity framework, a process model that explicitly connects to the nested model, a well-known visualization design decision model. The framework includes four overlapping activities that characterize the design process, with each activity explicating outcomes related to the nested model. Additionally, we describe and characterize a list of exemplar methods and how they overlap among these activities. The design activity framework is the result of reflective discussions from a collaboration on a visualization redesign project, the details of which we describe to ground the framework in a real-world design process. Lastly, from this redesign project we provide several research outcomes in the domain of cybersecurity, including an extended data abstraction and rich opportunities for future visualization research.

Keywords: Design, frameworks, process, cybersecurity, nested model, decisions, models, evaluation, visualization

Information Visualization for Science and Policy: Engaging Users and Avoiding Bias
G. McInerny, M. Chen, R. Freeman, D. Gavaghan, M.D. Meyer, F. Rowland, D. Spiegelhalter, M. Steganer, G. Tessarolo, J. Hortal. In Trends in Ecology & Evolution, Vol. 29, No. 3, pp. 148--157. 2014.
DOI: 10.1016/j.tree.2014.01.003

Visualisations and graphics are fundamental to studying complex subject matter. However, beyond acknowledging this value, scientists and science-policy programmes rarely consider how visualisations can enable discovery, create engaging and robust reporting, or support online resources. Producing accessible and unbiased visualisations from complicated, uncertain data requires expertise and knowledge from science, policy, computing, and design. However, visualisation is rarely found in our scientific training, organisations, or collaborations. As new policy programmes develop [e.g., the Intergovernmental Platform on Biodiversity and Ecosystem Services (IPBES)], we need information visualisation to permeate increasingly both the work of scientists and science policy. The alternative is increased potential for missed discoveries, miscommunications, and, at worst, creating a bias towards the research that is easiest to display.

Reflections on How Designers Design With Data
A. Bigelow, S. Drucker, D. Fisher, M.D. Meyer. In Proceedings of the ACM International Conference on Advanced Visual Interfaces (AVI), Note: Awarded Best Paper!, 2014.

In recent years many popular data visualizations have emerged that are created largely by designers whose main area of expertise is not computer science. Designers generate these visualizations using a handful of design tools and environments. To better inform the development of tools intended for designers working with data, we set out to understand designers' challenges and perspectives. We interviewed professional designers, conducted observations of designers working with data in the lab, and observed designers working with data in team settings in the wild. A set of patterns emerged from these observations from which we extract a number of themes that provide a new perspective on design considerations for visualization tool creators, as well as on known engineering problems.

Keywords: Visualization, infographics, design practice

The Nested Blocks and Guidelines Model
M.D. Meyer, M. Sedlmair, P.S. Quinan, T. Munzner. In Journal of Information Visualization, Special Issue on Evaluation (BELIV), 2014.

We propose the nested blocks and guidelines model (NBGM) for the design and validation of visualization systems. The NBGM extends the previously proposed four-level nested model by adding finer grained structure within each level, providing explicit mechanisms to capture and discuss design decision rationale. Blocks are the outcomes of the design process at a specific level, and guidelines discuss relationships between these blocks. Blocks at the algorithm and technique levels describe design choices, as do data blocks at the abstraction level, whereas task abstraction blocks and domain situation blocks are identified as the outcome of the designer's understanding of the requirements. In the NBGM, there are two types of guidelines: within-level guidelines provide comparisons for blocks within the same level, while between-level guidelines provide mappings between adjacent levels of design. We analyze several recent papers using the NBGM to provide concrete examples of how a researcher can use blocks and guidelines to describe and evaluate visualization research. We also discuss the NBGM with respect to other design models to clarify its role in visualization design. Using the NBGM, we pinpoint two implications for visualization evaluation. First, comparison of blocks at the domain level must occur implicitly downstream at the abstraction level; and second, comparison between blocks must take into account both upstream assumptions and downstream requirements. Finally, we use the model to analyze two open problems: the need for mid-level task taxonomies to fill in the task blocks at the abstraction level, as well as the need for more guidelines mapping between the algorithm and technique levels.

Page 10 of 23

Start
Prev
5
6
7
8
9
10
11
12
13
14
Next
End

SCI