SC16 RENCI Booth Schedule

 

sc16-logo_0

SALT LAKE CITY, Utah – Presentations about key RENCI projects, including iRODS and the iRODS Consortium, ExoGENI and advanced networking, the National Consortium for Data Science (NCDS), and the South Big Data Hub (SBDH) will be featured in the RENCI exhibit at SC16, the world’s premier conference for high performance computing, networking, storage, and analysis.

The RENCI booth (#3628) will open at 7 p.m. Monday, Nov. 14. RENCI experts will be in the booth to talk about and demonstrate advanced networking and cyberinfrastructure, the integrated Rule Oriented Data System (iRODS) and the iRODs Consortium, the Resource Aware Data-centrIc collaboratIon Infrastructure (RADII), the NCDS, and the (SBDH).

In addition to the Monday night opening gala, exhibit hours for the conference will be 10 a.m. – 6 p.m. Tuesday, Nov. 15 and Wednesday, Nov. 16, and 10 a.m. – 3 p.m. Thursday, Nov. 17.

Look for updates about our activities at SC16 via the following social media:

 logo_twitter  facebook_logo  periscope512


7 p.m. – 9 p.m.
Welcome SC16 attendees! The RENCI booth tonight will feature information on:

  • Advanced networking
  • The iRODS data management platform and the iRODS Consortium
  • The National Consortium for Data Science (NCDS)
  • The South Big Data Hub

South Big Data Innovation Hub: Partnerships for Impact
Panorama: Tools for Modeling, Monitoring, Anomaly Detection, and Adaptation for Scientific Workflows on Dynamic Networked Cloud Infrastructures
IRODS in the GABBS Project: Geospatial Data Management for Collaborative Scientific Research
iRODS as a data management and software tool for HPC systems and the Internet of Things
iRODS Overview
iRODS and HGST
Utilizing iRODS to Establish and Maintain an Automated Research Data Collaborative
iRODS Demonstrations & Overview


10:30 a.m. – 11:30 a.m.
South Big Data Innovation Hub: Partnerships for Impact
Presenters: Lea Shanley & Jay Aikat, RENCI

Description: The National Foundation launched four Big Data Regional Innovation Hubs in 2015, new organizations intended to develop the Big Data innovation ecosystem and facilitate thematic communities’ use of data sciences for societal benefit. Specifically, the South Big Data Regional innovation hub accelerates partnerships among people in business, academia, and government who apply data science and analytics to help solve regional and national challenges. Dr. Shanley will introduce the Big Data Hubs and report on some of the significant activities underway in data sharing and infrastructure, Smart Cities, health, and more. Finally, we will discuss opportunities to engage with the Big Data Hubs and our growing networks of Public/Private partnerships.

********

11:30 a.m – 12:30 pm
Panorama: Tools for Modeling, Monitoring, Anomaly Detection, and Adaptation for Scientific Workflows on Dynamic Networked Cloud Infrastructures
Presenters: Anirban Mandal, Paul Ruth, Ilya Baldin (RENCI – UNC Chapel Hill), Rajiv Mayani, Rafael Ferreira da Silva, Ewa Deelman (USC Information Sciences Institute), Jeremy Meredith, Jeffrey Vetter, Vickie Lynch, Ben Mayer (Oak Ridge National Laboratory), Chris Carothers (RPI), Brian Tierney (LBL)

Description: This demonstration will showcase a novel, dynamically adaptable networked cloud infrastructure driven by the demand of a data-driven scientific workflow through use of performance models. It will use resources from ExoGENI – a Networked Infrastructure-as-a-Service (NIaaS) testbed funded through NSF’s Global Environment for Network Innovation (GENI) project. The demo will run on dynamically provisioned ‘slices’ spanning multiple ExoGENI cloud sites that are interconnected using dynamically provisioned connections from Internet2 and ESnet. We will show how a virtual Software Defined Exchange (SDX) platform, instantiated on ExoGENI, provides additional functionality for supporting the modeling and management of scientific workflows. Using the virtual SDX slice, we will demonstrate how tools developed in the DoE Panorama project can enable the Pegasus Workflow Management System to monitor and manipulate network connectivity and performance between sites, pools, and tasks within a workflow. We will use representative, intensive DoE workflows as driving use cases to showcase the above capabilities.

 

********

12:30 p.m. – 1:30 p.m.
IRODS in the GABBS Project: Geospatial Data Management for Collaborative Scientific Research
Presenter: Rajesh Kalyanam, Purdue university

Description: Geospatial data is increasingly finding use in tools from diverse fields such as agronomy, hydrology and sociology to provide better and novel perspectives of scientific data. Funded by the NSF DIBBS program, the GABBS project seeks to create reusable building blocks aiding researchers in adding geospatial data processing, visualization and curation to their tools. Central to these capabilities is the efficient storage and processing of geospatial files. This talk will describe our work integrating iRODS with the HUBzero cyberinfrastructure framework to provide a web-based collaborative research platform with enhanced geospatial capabilities.

********

1:30 p.m. – 2:30 p.m. 

iRODS as a data management and software tool for HPC systems and the Internet of Things
Presenter: David Wade, Director of High Performance Computing, Integral Engineering and Research Associates 

Description: Data ingestion into a cluster of compute nodes, and parsing into databases and buffers suitable for analysis, control, storage and reporting, of SIGNALS collected from the Internet of Things may be accelerated and scaled. Such is shown here with the use of specially designed algorithms which employ off the shelf components normally operating in the domain of high Performance Computing: InfiniBand fabrics, multicore processor accelerators and gpgpus, burst buffers, parallel file systems, in-memory databases, persistent RAM, and subsystems for visualization and reporting. Software such as PI, HDF5, LUSTRE, POSTGRES and iRODS is provided to the data ingest agents to scale and ‘glue’ the system into place. The system provides scalable pieces to the analysis and application layers – for video pattern matching and processing, logging of telecommunication protocols, intervention into control systems, record management, or for ingest into commercial analysis tools where these and other applications may be created and defined: MatLab, Mathematica, Ansys and SAS. The system may run simulations which are matched to the collecting data for verification and situational awareness. The system provides a scalable reporting (and data management) capability largely through employment of the iRODS open source software. The suite of service iRODS may provide to the system as profiled here is explored in this talk.

********

2:30 – 3:30 pm
HGST iRODS
Presenter: Eric Friis, HGST

Description: TBD

********

3:30 p.m. – 4:30 p.m.
Utilizing iRODS to Establish and Maintain an Automated Research Data Collaborative
Presenters: Dave Fellinger, iRODS Consortium

Description: Funding organizations and institutions are establishing stringent guidelines regarding research data retention and indexing. In many cases audit requirements must be met to assure compliance to these rules. IRODS incorporates all of the tools required to provide automated indexing, compliance to retention policies, data storage tiering, distribution, and usage tracking including auditing and reporting.

********

4:30 p.m. – 5:30 p.m.
iRODS Demonstrations & Overview
Presenters: Jason Coposky, iRODS Consortium Executive Director; Terrell Russell, iRODS Senior Data Scientist

Description: Using material from recently presented workshops, we will demonstrate several key features that have made iRODS a critical technology for research organizations worldwide. We will show how storage resource composition makes it easy to distribute and replicate data across multiple file systems; how the iRODS rule engine can automate searchable metadata annotation; and how federation can be allows users to access data at remote sites through a common interface.

PANORAMA: Tools for Modeling, Monitoring, Anomaly Detection and Adaptation for Scientific Workflows on Dynamic Networked Cloud Infrastructures
EMC
iRODS Demonstrations
RADII & Hydroshare: Bridging the gap between data and infrastructure management
South Big Data Innovation Hub: Partnerships for Impact
Utilizing iRODS to establish and maintain an automated research data collaborative
iRODS Software & Consortium Update
iRODS and Reproducible Science in DDN Booth #1931


10:30 a.m. – 11:30 a.m.
PANORAMA: Tools for Modeling, Monitoring, Anomaly Detection and Adaptation for Scientific Workflows on Dynamic Networked Cloud Infrastructures
Presenters: Anirban Mandal, Paul Ruth, Ilya Baldin (RENCI – UNC Chapel Hill), Rajiv Mayani, Rafael Ferreira da Silva, Ewa Deelman (USC Information Sciences Institute), Jeremy Meredith, Jeffrey Vetter, Vickie Lynch, Ben Mayer (Oak Ridge National Laboratory), Chris Carothers (RPI) and Brian Tierney (LBL)

Description: This demonstration will showcase a novel, dynamically adaptable networked cloud infrastructure driven by the demand of a data-driven scientific workflow through use of performance models. it will use resources from ExoGENI – a Networked Infrastructure-as-a-Service (NIaaS) testbed funded through NSF’s Global Environment for Network Innovation (GENI) project. The demo will run on dynamically provisioned connections from Internet2 and ESnet. We will show how a virtual Software Defined Exchange (SDX) platform, instantiated on ExoGENI, provides additional functionality for supporting the modeling and management of scientific workflows. Using the virtual SDX slice, we will demonstrate how tools developed in the DoE Panaroma project can enable the Pegasus Workflow Management System to monitor and manipulate network connectivity and performance between sites, pools, and tasks within a workflow. We will use representative, data-intensive DoE workflows as driving use cases to showcase the above capabilities.

********

11:30 a.m. – 12:30 p.m.
EMC iRODS
Presenter: TBD

Description: TBD

********

12:30 p.m. – 1:30 p.m.
iRODS Demonstrations
Presenter: Jason Coposky, iRODS Consortium Executive Director; Terrell Russell, iRODS Senior Data Scientist

Description: Using material from recently presented workshops, we will demonstrate several key features that have made iRODS a critical technology for research organizations worldwide. We will show how storage resource composition makes it easy to distribute and replicate data across multiple file systems; how the iRODS rule engine can automate searchable metadata annotation; and how federation can allow users to access data at remote sites through a common interface.

********

 1:30 p.m. – 2:30 p.m.
RADII & Hydroshare: Bridging the gap between data and infrastructure management
Presenters: Claris Castillo, Fan Jiang, Ray Idaszak (RENCI – UNC-Chapel Hill)

Description: Data-centric collaborations have become the engines of scientific research. However, these collaborations can be difficult to realize because the appropriate infrastructure, including dedicated network infrastructure needed to transfer large data sets, is often unavailable and few mechanisms exist for controlling data access. Solutions that bridge the gap between infrastructure and data management technologies are needed to make data-centric collaborations feasible. This demonstration presents a novel cloud-based platform, called Resource Aware Data-centrIc collaboratIon Infrastructure (RADII), that addresses these challenges. RADII integrates the Open Resource Control Architecture (ORCA) and integrated Rule Oriented Data System (iRODS) to allow scientists to create and manage collaborations. We will demonstrate novel capabilities that enable dynamic changes to the collaborative infrastructure to support the ingestion of data from external repositories and a unique cross-layer data annotation mechanisms that in conjunction with SDN enables the prioritization of data transfers. This demo represents an early prototype to support the infrastructure needs of an existing cyber-community namely NSF HydroShare. We will show how RADII supports the on-demand provisioning of infrastructure to support the execution of hydrology models and dynamic storage demand.

********

 2:30 p.m. – 3:30 p.m.
South Big Data Innovation Hub: Partnerships for Impact
Presenters: Lea Shanley & Jay Aikat, RENCI

Description: The National Foundation launched four Big Data Regional Innovation Hubs in 2015, new organizations intended to develop the Big Data innovation ecosystem and facilitate thematic communities’ use of data sciences for societal benefit. Specifically, the South Big Data Regional Innovation Hub accelerates partnerships among people in business, academia, and government who apply data science and analytics to help solve regional and national challenges. Dr. Shanley will introduce the Big Data Hubs and report on some of the significant activities underway in data sharing and infrastructure, Smart Cities, health and more. Finally, we will discuss opportunities to engage with the Big Data Hubs and our growing networks of Public/Private partnerships.

********

3:30 p.m. – 4:30 p.m.
Utilizing iRODS to Establish and Maintain an Automated Research Data Collaborative
Presenters: Dave Fellinger, iRODS Consortium

Description: Funding organizations and institutions are establishing stringent guidelines regarding research data retention and indexing. In many cases audit requirements must be met to assure compliance to these rules. IRODS incorporates all of the tools required to provide automated indexing, compliance to retention policies, data storage tiering, distribution, and usage tracking including auditing and reporting.

********

4:30 p.m. – 5:30 p.m.
iRODS Software & Consortium Update
Presenters: Jason Coposky, iRODS Consortium Executive Director; Terrell Russell, iRODS Senior Data Scientist

Description: This talk will discuss the history, the present, and the future of iRODS development – with a focus on the newest 4.2 software features. The iRODS team will also discuss the state of the iRODS Consortium, the organization that supports continued development of iRODS data management software as free open source software.

********

5:00 p.m. – 5:30 p.m.
iRODS and Reproducible Science
LOCATION: DDN Booth #1931
Presenters: Terrell Russell, iRODS Senior Data Scientist
Description: iRODS can help your organization build collaborative, reproducible, provable data workflows with machine-actionable policy, data integrity, and verifiability.
Data management matters. Do it the right way. Come learn how.

********

RADII & Hydroshare: Bridging the gap between data and infrastructure management
iRODS Overview
South Big Data Innovation Hub: Partnerships for Impact
iRODS Demonstrations


10:30 a.m. – 11:30 a.m.
RADII & Hydroshare: Bridging the gap between data and infrastructure management
Presenters: Claris Castillo, Fan Jiang, Ray Idaszak (RENCI – UNC-Chapel Hill)

Description: Data-centric collaborations have become the engines of scientific research. However, these collaborations can be difficult to realize because the appropriate infrastructure, including dedicated network infrastructure needed to transfer large data sets, is often unavailable and few mechanisms exist for controlling data access. Solutions that bridge the gap between infrastructure and data management technologies are needed to make data-centric collaborations feasible. this demonstration presents a novel cloud-based platform, called Resource Aware Data-centrIc collaboratIon Infrastructure (RADII), that addresses these challenges. RADII integrates the Open Resource Control Architecture (ORCA) and integrated Rule Oriented Data System (iRODS) to allow scientists to create and manage collaborations. We will demonstrate novel capabilities that enable dynamic changes to the collaborative infrastructure to support the ingestion of data from external repositories and a unique cross-layer data annotation mechanism that in conjunction with SDN enables the prioritization of data transfers. This demo represents an early prototype to support the infrastructure needs of an existing cyber-community namely NSF HydroShare. We will show how RADII supports the on-demand provisioning of infrastructure to support the execution of hydrology models and dynamic storage demand.

********

11:30 a.m. – 12:30 p.m.
iRODS Overview
Presenters: Jason Coposky, iRODS Consortium Executive Director; Terrell Russell, iRODS Senior Data Scientist 

Description: This talk will present an overview of iRODS (http://irods.org), including an overview of its functions, architecture, use cases, and future directions. iRODS is open source data grid middleware that consolidates the management of heterogeneous data storage technologies. Equipped with configurable automation and metadata cataloging capabilities, over 100 PB of data is managed using iRODS worldwide. Example use cases include tracking gene sequencing workflows at several of the world’s preeminent research institutes and streaming terabytes of production video footage across the globe.

********

12:30 p.m. – 1:30 p.m.
South Big Data Innovation Hub: Partnerships for Impact
Presenters: Lea Shanley & Jay Aikat, RENCI

Description: The National Foundation launched four Big Data Regional Innovation Hubs in 2015, new organizations intended to develop the Big Data innovation ecosystem and facilitate thematic communities’ use of data sciences for societal benefit. Specifically, the South Big Data Regional Innovation Hub accelerates partnerships among people in business, academia, and government who apply data science and analytics to help solve regional and national challenges. Dr. Shanley will introduce the Big Data Hubs and report on some of the significant activities underway in data sharing and infrastructure, Smart Cities, health and more. Finally, we will discuss opportunities to engage with the Big Data Hubs and our growing networks of Public/Private partnerships.

********

1:30 p.m. – 2:30 p.m.
iRODS Demonstrations
Presenters: Jason Coposky, iRODS Consortium Executive Director; terrell Russell, iRODS Senior Data Scientist 

Description: Using material from recently presented workshops, we will demonsrate several key features that have made iRODS a critical technology for research organizations worldwide. we will show how storage resource composition makes it easy to distribute adn replicate data across multiple file systems; how the iRODS rule engine can automate searchable metadata annotation; and how federation can allow users to access data at remote sites through a common interface.

Show floor closes at 3 p.m.