Completed Research

Data Migration Practices and Tiered Storage Management: Challenges and Opportunities (PDF)
Firms continue to face multiple challenges in managing large amounts of data across a tiered storage infrastructure. Data migration between storage tiers is recognized as one of the most complex challenges. We review factors that can cause storage managers to develop ineffective data migration practices. We describe a prototype decision support tool that managers can use to better understand when to migrate data to different tiers. Effective data migration practices are critical to balancing information value, risk, and information management costs.

How Much Information? 2009 Report on American Consumers
UC San Diego Experts Calculate How Much Information Americans Consume
Computer Games and TV Account for Bulk of Information Consumed in 2008
U.S. households consumed approximately 3.6 zettabytes of information in 2008, according to the "How Much Information? 2009 Report on American Consumers," released today by the University of California, San Diego.

Healthcare Report: A Senior IT Executive Perspective on Data Growth in Hospitals
This research report is the culmination of interviews with ten executives in the healthcare industry discussing data storage, data growth, and expectations for data storage growth in the future.

MIT Case Studies (6) Summary
This summary outlines six detailed case studies, which describe the data generation, growth, retention, and sharing trends at MIT for the fields of: Biological Oceanography, Chemistry/Chemical Engineering, Climate Change, Materials Science and Engineering, Neuroimaging, and Physics.

  1. Biological Oceanography
    This paper examines 3 labs that study biological oceanography, using combinations of observational, experimental, and computational methods. Faster and cheaper instruments produce more data, in one case a five-fold increase in five years.
  2. Chemistry and Chemical Engineering
    The amount of data generated by the NMR spectrometers is growing as their usage increases. The case study concludes by explaining the DCIF’s data retention policies and the sharing and reuse of data with other university laboratories.
  3. Climate Change
    This case study provides an early look into the data growth projections for the embryonic Earth System Initiative (ESI) at MIT. Increases in the Laboratory’s computing power and storage capacity have helped drive data to increase by a factor of 100 in five years. At this rate, in the next five years ESI data production could reach 20 petabytes annually.
  4. Materials Science and Engineering
    This case study gives examples of how data is created and stored by material scientists and engineers at MIT. The amount of data depends on specific research goals and the tools, experimental techniques, and computational methods employed by the individual researcher. Both simulation and experiments are used, with the simulations producing more data in the cases reported here.
  5. Neuroimaging
    Each MRI session produces a total of 3.6 gigabytes of human image data per subject. The Center sees 1500 subjects a year, generating approximately 5.4 terabytes of data.
    The Center’s current rate of data generation will increase as scanner hardware and software improves. It is anticipated that improvements over the next five years will increase the size of subject data sessions by a factor of 10 (3.6 gigabytes of data currently, 36 gigabytes of data per session in five years).
  6. The Department of Physics at MIT
    MIT’s Physics department has about 90 experimental physics faculty, who generate massive amounts of data. The nature and size of each project varies, but they tend to run continuously over months or years. A rough extrapolation is that the Physics department as a whole stores about 2 * 1018 bytes a year (2 exabytes) of new data.

    All MIT Case Studies Combined (PDF)

    July 2009 HMI? Case Studies on Scientific Research at MIT Webinar (pdf)
    The July program webinar covered the above mentioned MIT Case Studies on Scientific Research at MIT. Authors: S. Madnick, M. Smith. The webinar was recorded, please click here to view.

November 2008 HMI? Program Webinar (PDF)
The November program webinar covered project discussions. Projects on processor capability and green storage are covered. Authors: P. Tallon and G. Hidley

October 2008 HMI? Program Webinar (PDF)
The October program webinar covered project discussions. Projects on scaling effects in IT and Global Mobile Communications are covered. Authors: C. Magee and E. Fife

September 2008 HMI? Program Webinar (PDF)
The September program webinar covered project discussions and a general information survey. The Executive Interviewing Project and background information on data centers are covered. Authors: R. Bohn and J. Short

July 2008 HMI? Program Webinar (PDF)
The July program webinar covered initial research questions, background on storage research, and current examples of information in use. Authors: R. Bohn and J. Short

May 2008 HMI? Program Launch (PDF)
The HMI? Research program was formally launched in May 2008. This slidedeck presents the program and initial ideas. Authors: R. Bohn and J. Short

GIIC Report: Digital Home Entertainment Project Handbook (November 2008) (PDF)
The Digital Home project was a year-long research study carried out by USC and UCSD. The study sought to provide insights for a better understanding of digital home evolution, drivers, and user experience. The research was completed by Professor Omar El Sawy (USC) and Dr. James Short (UCSD).

GIIC Report: Information Lifecycle Management (August 2007) (PDF)
Effective information lifecycle management can minimize business risk, reduce costs, and incr ease control over data. SIM's Advanced Practices Council (APC) members sponsored a year-long study to learn how to reap these benefits. This report summarizes findings from that study, citing twelve lessons learned from early information lifecycle initiatives. Author(s): James E. Short

Original HMI? Reports

How Much Information? 2009 follows two University of California, Berkeley research studies, HMI? 2000 and HMI? 2003, conducted by Professors Hal Varian and the late Peter Lyman. HMI? 2009 builds on the two Berkeley studies, but uses different definitions and measurement methods. Below are links to the original HMI? 2000 and HMI? 2003 reports.

2000, UC Berkeley How Much Information?
http://www2.sims.berkeley.edu/research/projects/how-much-info/

2003, UC Berkeley How Much Information?
http://www2.sims.berkeley.edu/research/projects/how-much-info-2003/