Building Dynamic Data Centers for Fast Delivery of New Data and Data Updates
Data(s) |
03/07/2014
03/07/2014
10/06/2014
|
---|---|
Resumo |
Poster at Open Repositories 2014, Helsinki, Finland, June 9-13, 2014 Posters, Demos and Developer "How-To's" In this presentation I will demonstrate and explain the code that I wrote to build a dynamic data center – the CISER Data Archive Census 2010 Summary File 1 (SF1) Download Center – and its importance in the timely delivery of updated datasets at minimal cost. The code is simple, efficient, and easy to use. When updates are released by the U.S. Census Bureau (CB), I only need to update the source files and run the code to make current all files on the Download Center that are for consumer download. The code is designed to eliminate the multi-step process that consumers would have to undertake to get the information they want, apply the updates to or enhance the 49 file segments provided by the CB, create full data sets by merging the segments; make them available in SAS, SPSS, STATA, and CSV format and zip them for download; and automatically update the download center’s website including information about the size of the compressed (zipped) and uncompressed versions of the datasets. The process to build this dynamic data center is scalable and may be used as a model or guide by other repositories who are planning to develop their own. |
Identificador |
http://www.doria.fi/handle/10024/97594 URN:NBN:fi-fe2014070432230 |
Idioma(s) |
en |
Relação |
Poster Reception Open Repositories 2014 Cornell University, United States of America |
Palavras-Chave | #CISER #Data Center #Repository #SF1 #Census |
Tipo |
Poster |