Center for Data Analytics Workshop - April 15

The new Center for Data Analytics (CDA) at Georgia Tech will offer its first workshop on Big Data Research and Development to exchange information on expertise and activities related to data analytics at Georgia Tech, GTRI, and Emory University. The workshop will take place on Monday, April 15, from 8:30 a.m. to 2:50 p.m. The CDA workshop is designed to strengthen and build the Georgia Tech community of researchers and partners in areas of big data. It is sponsored in part by the Institute for Data and High Performance Computing.

Location: Marcus Nanotechnology Research Building (Rooms 1116-1118), 345 Ferst Drive, Atlanta, Ga 30332


Program Format

The workshop will consist of several invited talks, a panel, and a poster session. 


Workshop Schedule

8:30 - 8:50   Breakfast

8:50 – 9:00   Welcoming Remarks - Richard Fujimoto (Director of the Institute for Data and High Performance Computing), Haesun Park (Director of the Center for Data Analytics), Zvi Galil (Dean of the College of Computing)

9:00 – 10:30   Invited Talks, Session Chair: Nina Balcan

9:00 – 9:15   Computational Challenges in the Development of Improved Cancer Diagnostics and Therapies - John McDonald (Georgia Tech, Biology)

9:15 – 9:30   Applied Data Analytics at GTRI - Dan Campbell (Georgia Tech Research Institute)

9:30 – 9:45   Data and Computational Challenges in Integrative Biomedical Informatics - Joel Saltz (Director of the Center for Comprehensive Informatics; Emory University, Biomedical Informatics)

9:45 – 10:00   Sparsity and Rank Constraints in Solving Underdetermined Systems - Justin Romberg (Georgia Tech, Electrical and Computer Engineering)

10:00 – 10:15   Data Analytics for Materials Datasets - Surya Kalidindi (Georgia Tech, Mechanical Engineering)

10:15 – 10:30   Leveraging Geospatial Technologies in Planning for a Sustainable Future - 

Subhro Guhathakurta (Director of the Center for Geographic Information Systems; Georgia Tech, Architecture)

10:30 – 10:45   Break

10:45 – 12:10   Invited Talks, Session Chair: Hongyuan Zha

10:45 – 11:00   Operational Aspects of Big Data for a Research University - Ron Hutchins (Georgia Tech, Office of Information Technology)

11:00 – 11:15   The Role of Predictive Models in Data Analytics - Xiaoming Huo (Georgia Tech, Industrial and Systems Engineering)

11:15 – 11:30   The Challenge of Large Data Sets – Examples from Petascale Simulations of Fluid Turbulence - P.K. Yeung (Georgia Tech, Aerospace Engineering)

11:30 – 11:45   Big Data Analytics: Long Tail, Web Analytics, and Social Media - Jeffrey Hu (Georgia Tech, Business)

11:45 – 12:00   Learning and Optimizing Information Diffusion - Le Song (Georgia Tech, Computational Science and Engineering)

12:00 – 12:10   Remarks about Data Analytics at Georgia Tech - Steve Cross (Georgia Tech, Executive VP of Research)

12:10 – 1:45   Lunch and Poster Session

1:45 – 2:45   Panel Discussion - Panelists: Jianjun Shi (Industrial Systems & Engineering), John Wise (Physics), Suresh Menon (Aerospace Engineering), Mark Braunstein (Associate Director of the Health Systems Institute; Interactive Computing), John Stasko (Interactive Computing)

Chair: Polo Chau (Georgia Tech, Computational Science and Engineering)

2:45 – 2:50   Concluding Remarks - Haesun Park and Richard Fujimoto


List of Posters

A Cubic Algorithm for Computing Gaussian Volume
Authors: Ben Cousins and Santosh Vempala

Accelerating Analysis and Data Reduction in Experimental Combustion Research through High Performance Computing
Authors: Benjamin Wilde, Christopher Foley, Andrew Marshall, Benjamin Emerson, J. Matt Quinlan, Tim Lieuwen, Drew Bratcher and Matthew Wolf

 Apolo: Interactive Graph Exploration Combining Machine Learning and Visualization
Authors: Polo Chau, Niki Kittur, Jason Hong and Christos Faloutsos

Beyond Sentiment: The Manifold of Human Emotions
Authors: Seungyeon Kim, Fuxin Li, Guy Lebanon and Irfan Essa

Bounded Low Rank Matrix Approximation
Authors: Ramakrishnan Kannan, Mariya Ishteva and Haesun Park

Distributed Clustering on Graphs
Authors: Maria-Florina Balcan, Steven Ehrlich and Yingyu Liang

Distributed Non-negative Matrix Factorization
Authors: Long Tran, Hua Ouyang, Alexander Gray and Haesun Park

Efficient Active and Semi-Supervised Learning of Disjunctions
Maria Florina Balcan, Chris Berlind, Steven Ehrlich and Yingyu Liang

Fast Rank-2 Non-negative Matrix Factorization for Hierarchical Clustering
Authors: Da Kuang and Haesun Park

Infrastructure for Managing and Analyzing Biological Networks Derived 
from Collections of Plant Images
Authors: Abhiram Das, Alexander Bucksch and Joshua S. Weitz

Intelligent Systems and Big Data at GTRI
Authors: Elizabeth Whitaker and Ethan Trewhitt

Local Low-Rank Matrix Approximation
Authors: Joonseok Lee, Seungyeon Kim, Guy Lebanon and Yoram Singer

Low Rank Estimation of Smooth Kernels on Graphs
Authors: Vladimir Koltchinskii and Pedro Rangel

OPAvion: Mining & Visualization in Large Graphs
Authors: Polo Chau, Leman Akoglu, U. Kang, Danai Koutra and Christos Faloutsos

Potential Induced Random Teleportation on Finite Graphs
Authors: Shui-Nee Chow, Xiaojing Ye and Haemin Zhou

Red Fox: An Execution Environment for Data Warehousing Applications on GPUs
Authors: Haicheng Wu, Gregory Diamos, Tim Sheared, Molham Aref and Sudhakar Yalamanchili

StochasticADMM for Nonsmooth Optimization
Authors: Hua Ouyang, Niao he, Long Tran and Alexander Gray

The Sample Complexity of Independent Component Analysis
Authors: Santosh Vempala and Ying Xiao

Uncover Topic-Sensitive Information Diffusion Networks
Authors: Nan Du, Le Song, Hyenkyun Woo and Hongyuan Zha

VisIRR: Interactive Visual Information Retrieval and Recommendation for Large-scale Document Data
Authors: Jaegul Choo, Changhyun Lee, Edward Clarkson, Zhicheng Liu, Hanseung Lee, Duen Horng (Polo) Chau, Fuxin Li, Ramakrishnan Kannan, Charles D. Stolper, David Inouye, Nishant Mehta, Hua Ouyang, Subhojit Som, Alexander Gray, John Stasko and Haesun Park


Event Dates: 
Monday, April 15, 2013 - 8:30am to 2:50pm