Publications
From CEDPS
Contents |
Publications
- "Wrap Scientific Applications as WSRF Grid Services using gRAVI", Chard, K.; Boverhof, J.; Tan, W.; Madduri, R.; Foster, I.; to appear in Proceedings of the International Conference on Web Services, Los Angeles, CA, July 2009. PDF
- "Scientific workflows as services in caGrid: a Taverna and gRAVI approach".IEEE International Conference on Web Services (ICWS 2009),Wei Tan, Kyle Chard, Dinanath Sulakhe, Ravi Madduri, Ian Foster, Stian Soiland-Reyes, Carole Goble. July, 2009. (Research track)
- "The Globus Replica Location Service: Design and Experience", Chervenak, A.; Schuler, R.; Ripeanu, M.; Amer, M.; Bharathi, S.; Foster, I.; Iamnitchi, A.; Kesselman, C.; to appear in IEEE Transactions on Parallel and Distributed Systems, 2009. PDF
- "Data Staging Strategies and Their Impact on the Execution of Scientific Workflows", Shishir Bharathi, Ann Chervenak, to appear in Proceedings of the International Workshop on Data-Aware Distributed Computing (DADC'09) in conjunction with the 18th International Symposium on High Performance Distributed Computing (HPDC-18), Munich, Germany, June 2009. PDF
- GridFTP Multilinking, John Bresnahan, Michael Link, Raj Kettimuthu and Ian Foster, to appear in Proceedings of the 2009 TeraGrid Conference, Arlington, VA, June 2009. Paper
- UDT as an Alternative Transport Protocol for GridFTP, John Bresnahan, Michael Link, Raj Kettimuthu and Ian Foster, to appear in Proceedings of the 7th International Workshop on Protocols for Future, Large-Scale and Diverse Network Transports (PFLDNeT 2009), Tokyo, Japan, May 2009. Paper
- "Scaling up Workflow-based Applications", Scott Callaghan, Ewa Deelman, Dan Gunter, Gideon Juve, Philip Maechling, Christopher Brooks, Karan Vahi, Kevin Milner, Robert Graves, Edward Field, David Okaya, Thomas Jordan. Journal of Computer and System Sciences, Special Issue on Workflows, in submission.
- Reducing Time-to-Solution Using Distributed High-Throughput Mega-Workflows – Experiences from SCEC CyberShake, Scott Callaghan, Phil Maechling, Ewa Deelman, Karan Vahi, Gaurang Mehta, Gideon Juve, Kevin Milner, Robert Graves, Edward Field, David Okaya, Dan Gunter, Keith Beattie, Thomas Jordan. ourth IEEE International Conference on eScience (eScience 2008), Indianapolis, IN, USA, December 2008. Draft: PDF
- "Characterization of Scientific Workflows", Shishir Bharathi, Ann Chervenak, Ewa Deelman, Gaurang Mehta, Mei-Hui Su, Karan Vahi, The 3rd Workshop on Workflows in Support of Large-Scale Science (WORKS08), in conjunction with Supercomputing (SC08) Conference, Austin, Texas, November, 2008. PDF
- "Policy-Driven Data Management for Distributed Scientific Collaborations Using a Rule Engine", (poster paper) Sara Alspaugh, Ann Chervenak, Ewa Deelman, Supercomputing (SC08) Conference, Austin, Texas, November 2008. Received Best Undergraduate Student Poster award in ACM Student Poster competition. PDF
- "Using Overlays For Efficient Data Transfer Over Shared Wide-Area Networks, Gaurav Khanna, Umit Catalyurek, Tahsin Kurc, Raj Kettimuthu, P. Sadayappan, Ian Foster and Joel Saltz, Proceedings of the 2008 ACM/IEEE conference on Supercomputing (SC'08), November 2008. Paper
- A GridFTP Transport Driver for Globus XIO, Raj Kettimuthu, Liu Wantao, Joseph Link and John Bresnahan, Proceedings of the 2008 International Conference on Parallel and Distributed Processing Techniques and Applications (PDPTA 2008), July 2008. Paper
- Integrating caGrid and TeraGrid, Christine Hung, Ravi Madduri, Kiran Keshav, Scott Oster, Stephen Langella, Stuart Martin, Stephen Mock. TeraGrid 2008, Las Vegas, NV, 2008
- Multi-Hop Path Splitting and Multi-Pathing Optimizations for Data Transfers over Shared Wide-Area Networks using GridFTP, Gaurav Khanna, Umit Catalyurek, Tahsin Kurc, Raj Kettimuthu, P. Sadayappan, Joel Saltz and Ian Foster, Proceedings of the 17th IEEE International Symposium on High-Performance Distributed Computing (HPDC), June 2008. Paper
- "Data Management Challenges of Data-Intensive Scientific Workflows", Ewa Deelman, Ann Chervenak, 3rd International Workshop on Workflow Systems in e-Science (WSES 08), in conjunction with CCGrid 2008 Conference, Lyon, France, May 20, 2008. PDF
- A Dynamic Scheduling Approach for Coordinated Wide-Area Data Transfers using GridFTP, Gaurav Khanna, Umit Catalyurek, Tahsin Kurc, Raj Kettimuthu, P. Sadayappan and Joel Saltz, Proceedings of the 22nd IEEE International Parallel and Distributed Processing Symposium (IPDPS 2008), April 2008. Paper
- A Data Placement Service for Petascale Applications, Ann L. Chervenak, Robert Schuler, USC Information Sciences Institute, in the Proceedings of PDSI Workshop, Paper
- Globus GridFTP: What's New in 2007, John Bresnahan, Michael Link, Gaurav Khanna, Zulfikar Imani, Raj Kettimuthu and Ian Foster, Proceedings of the First International Conference on Networks for Grid Applications (GridNets 2007), October 2007. Paper
- Log Summarization and Anomaly Detection for Troubleshooting Distributed Systems, J. Bresnahan, A. Brown, D. Gunter, J. M. Schopf, M. Swany, B. L. Tierney, Proceedings of the 8th IEEE/ACM International Conference on Grid Computing (Grid 2007), Austin, TX, 2007. Paper Talk Slides
- End-to-End Data Solutions for Distributed Petascale Science, Jennifer M. Schopf, Ann Chervenak, Ian Foster, Dan Fraser, Dan Gunter, Nick LeRoy, Brian Tierney, Invited paper, CTWatch Quarterly, October 2007. Paper
- Data Placement for Scientific Applications in Distributed Environments, Ann Chervenak, Ewa Deelman, Miron Livny, Mei-Hui Su, Rob Schuler, Shishir Bharathi, Gaurang Mehta, Karan Vahi, Proceedings of 8th IEEE/ACM International Conference on Grid Computing (Grid 2007), Austin, TX, 2007. Paper Presentation at Grid Conference (PDF)
- LEGS: A WSRF Service to Estimate Latency Between Arbitrary Hosts on the Internet, R. Vijayprasanth, R. Kavithaa and Raj Kettimuthu, Proceedings of the 2007 International Conference on Parallel and Distributed Processing Techniques and Applications (PDPTA 2007), June 2007. Paper
- GridFTP Pipelining, John Bresnahan, Michael Link, Raj Kettimuthu, Dan Fraser and Ian Foster, Proceedings of the 2007 TeraGrid Conference, June 2007. Paper
- Enabling Distributed Petascale Science, Andrew Baranovski, Shishir Bharathi, John Bresnahan, Ann Chervenak, Ian Foster, Dan Fraser, Tim Freeman, Dan Gunter, Keith Jackson, Kate Keahey, Carl Kesselman, David E. Konerding, Nick Leroy, Mike Link, Miron Livny, Neill Miller, Robert Miller, Gene Oleynik, Laura Pearlman, Jennifer M. Schopf, Robert Schuler, Brian Tierney, Proceedings of SciDAC 2007, Boston, MA, 2007. Paper
- The CEDPS Troubleshooting Architecture and Deployment on the Open Science Grid, Brian L. Tierney, Dan Gunter, Jennifer M. Schopf, Proceedings of SciDAC 2007, Boston, MA, 2007. Paper
- Virtual Workspaces for Scientific Applications, Kate Keahey, Tim Freeman, Jerome Laurent, Doug Olson, Proceedings of SciDAC 2007, Boston, MA, 2007. Paper
- Harnessing Multicore Processors for High Speed Secure Transfer, John Bresnahan, Raj Kettimuthu, Michael Link and Ian Foster, Proceedings of the 26th IEEE Infocom's High-Speed Networks Workshop, May 2007. Paper
- GridCopy: Moving Data Fast on the Grid, Raj Kettimuthu, William Allcock, Lee Liming, John-Paul Navarro and Ian Foster, Proceedings of the Fourth High Performance Grid Computing Workshop to be held in conjunction with International Parallel and Distributed Processing Symposium (IPDPS 2007), March 2007. Paper
Design documents and reports
- Data Placement Service Design draft (PDF) by Robert Schuler, ISI
- Economics Of Storage Management draft (PDF) by Andrew Baranovski, FNAL, April 2008
- Best Practices Document, Brian Tierney and Dan Gunter, CEDPS Design Document, July 2007
- Virtual Workspace Roadmap, Sept 2007
- Data Placement Service Design Draft, R. Shuler and A. Chervenak, July 2007,
- MOPS Architecture Overview, J. Bresnahan, R. Kettimut, D. Fraser, N. Leroy, July 2007
- Midterm report, March 2007
- Dynamic Resource Provisioning, I. Raicu and I. Foster, CEDPS Design Document, October 2006
- Management Plan, November 2006
- Original Proposal, March 2006
Presentations
- Dan Gunter, CEDPS Troubleshooting, OSG All-hands meeting, March 2009.
- MIron Livny, The Data Placement Problem - a view from the Open Science Grid (OSG), Japan-NSF networking workshop, October 2008.
- Raj Kettimuthu, Reliable Data Movement using Globus GridFTP and RFT: New Developments in 2008, Exhibitor's Forum, The SC08 Conference, Austin, TX, November 2008.
- Dan Gunter, Scott Callaghan, Gaurang Mehta, Gideon Juve, Keith Beattie, Ewa Deelman, Phil Maechling, Brian Tierney, Karan Vahi, "When Workflow Management Systems and Logging Systems Meet: Analyzing Large-Scale Execution Traces" (PDF), Poster Session at SC08.
- Raj Kettimuthu, Data Movement Tools for Distributed Petascale Science, Maseeh College of Engineering and Computer Science, Portland State University, Portland, OR, September 2008
- Ravi Madduri, ExpeditionWorkshop08192008.ppt, Collaborative Expedition Workshop, "The Role of Cyberinfrastructure in Scientific Knowledge: Emergence, Validation, and Peer Review", August 19, 2008
- Raj Kettimuthu, Reliable Data Movement Framework for Distributed Science Environments, The 2008 World Congress in Computer Science, Computer Engineering, and Applied Computing, Las Vegas, NV, July 2008
- Dan Gunter, NetLogger Framework, STAR Collaboration meeting at UC Davis, CA, 20 June 2008. http://nuclear.ucdavis.edu/~bhaag/STAR_CollaborationMeeting/
- Andrew Baranovski, Troubleshooting dCache/SRM, For OSG AHM meeting, Livingston, NJ , May 2008
- Miron Livny, Distributed (Data)Infrastructure/Collaboratories, DOE Office of Science ASCR PI Meeting, April 2008.
- Ian Foster, Services for Science, Keynote talk at INGRID Conference, Ischia, Italy, April 2008. Similar talks presented at: University of Florida and elsewhere.
- Ravi Madduri, State of Service Oriented Science Tooling, Open Source Grid and Cluster Conference, May 13-15 2008
- Ann Chervenak and Raj Kettimuthu, Globus Data Services for Science, Open Source Grid and Cluster Conference, Oakland, CA, May 2008
- Raj Kettimuthu, Globus GridFTP and RFT: An Overview and New Features, National Energy Research Scientific Computing Center, Oakland, CA, May 2008
- John Bresnahan, GridFTP and Cluster Meltdown: When No Means 'Maybe Later', Open Source Grid and Cluster Conference, Oakland, CA, May 2008
- Brian Tierney, Grid Troubleshooting, Open Source Grid and Cluster Conference, May 13-15 2008
- Dan Gunter, NetLogger Toolkit for Grid Troubleshooting, Open Source Grid and Cluster Conference, May 13-15 2008
- John Bresnahan, GridFTP: Challenges In Data Transport, The 22nd Open Grid Forum (OGF22), Cambridge, MA, February 2008.
- Miron Livny, Making CI Work for You, NSF Workshop on Cyberinfrastructure for Genomics, January 2008.
- Raj Kettimuthu, The Globus GridFTP Framework and Server, Department of Computer Science, National Tsing Hua University, Hsinchu, Taiwan, December 2007
- Brian Tierney and Ann Chervenak, Overview of CEDPS Troubleshooting and Data Areas, SDM Center All Hands Meeting, Nov 28-30 2007
- Jennifer Schopf, CEDPS Overview, Booth Talk, SuperComputing 2007, Reno, NV, November 13, 2007
- Dan Gunter, Grid troubleshooting, Booth Talk, SuperComputing 2007, Reno, NV, November 13, 2007
- Raj Kettimuthu, A Sneak Peak of What's New in Globus GridFTP, Exhibitor's Forum, The SC07 Conference, Reno, Nevada, November 2007
- Kate Keahey, Globus Virtual Workspaces, Booth Talk, SuperComputing 2007, Reno, NV, November 13, 2007
- Ravi Madduri, Remote Application Virtualization Interfaces, Booth Talk, SuperComputing 2007, Reno, NV, November 14, 2007
- Raj Kettimuthu, Tutorial: Configuring and Deploying GridFTP for Managing Data Movement in Grid/HPC Environments, SC Technical Program, SuperComputing 2007 Reno, NV, November 12, slides
- Jennifer Schopf, CEDPS: Center for Enabling Distributed Petascale Science, SciDAC Centers and Institutes Status Workshop, Las Vegas, NV, October 15, 2007.
- Raj Kettimuthu Globus GridFTP: What's New in 2007, First International Conference on Networks for Grid Applications (GridNets 2007), Lyon, France, October 2007
- Brian Tierney, Log Summarization and Anomaly Detection for Troubleshooting Distributed Systems, Grid2007 Conference, Sept 21, 2007,
- Brian Tierney, Centrallized Logging for the Open Science Grid Hepix Fall 2007
- Ian Foster, Enabling Distributed Petascale Science, Scientific Discovery through Advanced Computing Conference, Boston, Mass., May 2007.
- Ian Foster, Scaling eScience Impact, 1st Iberian Cyberinfrastructure Conference, Santiago de Compostela, Galicia, Spain (May 15, 2007). [Earlier versions presented at the German eScience Conference, Baden Baden, Germany (May 2, 2007), and elsewhere.]
- Brian Tierney, "Centralized Logging for Grid Troubleshooting", NERSC Invited Seminar, Sept 6, 2007, https://www.nersc.gov/news/presentations/osf_lunch/cal/show.php?CLm=09&CLd=6&CLy=2007&c_num=OSF_Lunch
- Ian Foster, "CEDPS Introduction", R. Lindsay visit to ANL, July 30, 2007, http://www.cedps.net/images/0/0c/Cedps_overview.pdf
- Dan Fraser, "CEDPS Data", R. Lindsay visit to ANL, July 30, 2007, http://www.cedps.net/images/5/5d/CEDPS_Data_Overview.pdf
- Kate Keahey, "CEDPS scalable services", R. Lindsay visit to ANL, July 30, 2007, http://www.cedps.net//images/6/6a/CEDPS_Services_Overview.pdf
- Jennifer Schopf, "CEDPS Troubelshooting", R. Lindsay visit to ANL, July 30, 2007, http://www.cedps.net/images/4/45/CEDPS_TS_Overview.pdf
- Ian Foster, "Future Opportunities", R. Lindsay visit to ANL, July 30, 2007, http://www.cedps.net/images/d/d3/Future_opportunities.pdf
- Ian Foster, "Enabling Distributed Petascale Science", Annual SciDAC Meeting 2007, Boston, MA, June 28, 2007, http://www.cedps.net/images/b/b9/070628_SciDAC_Foster.pdf
- Dan Gunter, "Troubleshooting Data Movement", CLADE: Challenges of Large Applications on Distributed Environments 2007, Monterey, CA, June 25, 2007, http://www.cedps.net/images/e/e3/Clade2007_dkg.ppt
- Dan Fraser, "A Managed Object Placement Service (MOPS) Using NEST and GridFTP", Condor Week, Madison, Wisconsin, May 1, 2007. http://www.cs.wisc.edu/condor/CondorWeek2007/tuesday_condor.html
- Jennifer M. Schopf, “Logging Best Practices," Open Grid Forum 20, Manchester, UK, March 9 2007,http://cedps.net/images/4/45/CEDPS-LoggingBestPracticesOGF20.pdf
- Ian Foster, “Service-Oriented Science: Scaling eScience Impact,” Distinguished Lecture, Louisiana State University, November 27, 2006. Also presented as Keynote, Web Intelligence Conference, Hong Kong, December 18, 2006; and at the University of Capetown, March 29, 2007.
- Ian Foster, “Grid,” IFIP Summer School on Software Engineering and Computer Science, Gordon’s Bay, South Africa, March 26, 2007.
- Ian Foster, “System-Level Science: Scientific Exploration & IT Implications,” Keynote, Workshop on Wireless Networking, Automated Information Processing, and Web & Grid Services, Puerto Rico, February 4, 2007.
- Brian Tierney, “Logging Recommendations for Effective Troubleshooting,” OSG Consortium All Hands Meeting, March 6, 2007. https://indico.fnal.gov/contributionDisplay.py?contribId=120&sessionId=13&confId=468
- Ann Chervenak, “Data Services in the SciDAC CEDPS Project,” OSG Consortium All Hands Meeting, March 6, 2007. https://indico.fnal.gov/contributionDisplay.py?contribId=90&sessionId=53&confId=468
- Jennifer M. Schopf, “SciDAC Center for Enabling Distributed Petascale Science,” Oak ridge National Laboratory, Oak Ridge, TN, February 8, 2007. http://www-unix.mcs.anl.gov/~schopf/Talks/cedps-ornl-feb07.ppt
- Jennifer M. Schopf, “CEDPS and CDIGS: Two Globus Projects,” Middleware And Grid Infrastructure Coordination (MAGIC), National Science Foundation, Arlington, VA, February 7, 2007. http://www-unix.mcs.anl.gov/~schopf/Talks/cedps-nsf-feb07.ppt
- Dan Fraser and Ann Chervenak, “Data Services: Future Directions,” Earth System Grid meeting in Boulder, CO, January 17-19, 2007.
- Ann Chervenak and Dan Fraser, “Data Services: Future Directions,” TeraGrid Data Meeting in San Diego, January 9-11, 2007.
- Kate Keahey, “On-Demand Virtual Workspaces: Quality of Life in the Grid,” 5th Meeting of Spanish Initiative in Grid Middleware, November 2006 http://workspace.globus.org/papers/on_demand_workspaces_granada.ppt
- Ann Chervenak, “Next Generation Data Services,” Fermi National Laboratory, October 27, 2006
- Tim Freeman and Kate Keahey, “Virtual Workspace Appliances,” SC06 Booth Presentation, November 2006,http://workspace.globus.org/papers/workspace_appliances_sc06_booth.pdf
- Two page overview for SC 2006, November 2006
- Six slide overview, October 2006
Posters
- "Communicating Security Assertions over the GridFTP Control Channel, Raj Kettimuthu, Liu Wantao, Frank Siebenlist and Ian Foster, 4th IEEE International Conference on e-Science, Dec 2008. PPT
- "When Workflow Management Systems and Logging Systems Meet: Analyzing Large-Scale Execution Traces". Dan Gunter, Scott Callaghan, Gaurang Mehta, Gideon Juve, Keith Beattie, Ewa Deelman, Phil Maechling, Brian Tierney, Karan Vahi. SC08, Austin, TX. Abstract: Word, PDF.
- Bridging the divide between DOE facility users and their remote data [CEDPS overview poster], SciDAC Annual Meeting 2008, Seattle, Washington, July 2008. (PDF | PPT)
- Troubleshooting on the Open Science Grid, SciDAC Annual Meeting 2007, Boston, MA, June 2007 (PDF | PPT)
- Virtual Workspaces for Scientific Applications, SciDAC Annual Meeting 2007, Boston, MA, June 2007. (PDF)
Tutorials
SC 2008 GridFTP Tutorial, November 17, 2008, Austin, TX
- Configuring and Deploying GridFTP for Managing Data Movement in Grid/HPC Environments
- Raj Kettimuthu, John Bresnahan, Mike Link
Midwest Grid School 2008 Data Management Tutorial, Sep 2008, Chicago, IL
- Distributed Data Management in Grid Environments
- Raj Kettimuthu
Open Source Grid and Cluster Conference 2008 GridFTP Tutorial, May 2008, Oakland, CA
- Managing Data Movement with Globus GridFTP,
- Raj Kettimuthu and John Bresnahan
SC 2007 GridFTP Tutorial, November 12, 2007, Reno, NV
- Configuring and Deploying GridFTP for Managing Data Movement in Grid/HPC Environments
- Raj Kettimuthu, John Bresnahan, Mike Link
SC 2007 Globus Tutorial, November 11, 2007, Reno, NV
- Tutorial: Globus Overview and Hands on
- Jen Schopf, Ravi Madduri, John Bresnahan (ANL), Laura Pearlman (ISI)
SciDAC 2007 Tutorial, June 29, 2007, Boston, MA
- Introduction to Grids and CEDPS (powerpoint)
- Troubleshooting (powerpoint)
- Data (powerpoint)
The 8th LCI (Linux Clusters Institute) Conference on High-Performance Clustered Computing, South Lake Tahoe, CA, May 2007.
- Optimizing Data Transport: A Tutorial on Deploying GridFTP – From Simple to Advanced Feature Configurations
- John Bresnahan and Mike Link
Awards
- The tools developed under Services Area (gRAVI) received an award for "Outstanding technology Achievement award" at the Annual caBIG meeting in Washington D.C for integrating caBIG and TeraGrid.
