CHUNKY: Stata module to chunk a large text file into smaller parts
David Elliott ()
Additional contact information
David Elliott: Nova Scotia Department of Health
Statistical Software Components from Boston College Department of Economics
Abstract:
chunky breaks a large text file into chunks of a size specified by the user. It is typically used to break a huge data dump that is too large for infiling into smaller manageable chunks. chunky will allow creation of serially named chunks for subsequent infiling or insheeting. The smaller data subsets can then be appended together to create a dataset with all required observations. This version of chunky has been completely rewritten to use the Mata capabilities of Stata release 9 and higher and the syntax has completely changed. The previous version has been deprecated as chunky8. Some users may still require a line-indexed method of chunking files so chunky8 will continue to be supported.
Language: Stata
Requires: Stata version 9
Keywords: data management; ASCII files (search for similar items in EconPapers)
Date: 2009-01-23, Revised 2010-09-01
Note: This module should be installed from within Stata by typing "ssc install chunky". The module is made available under terms of the GPL v3 (https://www.gnu.org/licenses/gpl-3.0.txt). Windows users should not attempt to download these files with a web browser.
References: Add references at CitEc
Citations:
Downloads: (external link)
http://fmwww.bc.edu/repec/bocode/c/chunky.ado program code (text/plain)
http://fmwww.bc.edu/repec/bocode/c/chunky.hlp help file (text/plain)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:boc:bocode:s456994
Ordering information: This software item can be ordered from
http://repec.org/docs/ssc.php
Access Statistics for this software item
More software in Statistical Software Components from Boston College Department of Economics Boston College, 140 Commonwealth Avenue, Chestnut Hill MA 02467 USA. Contact information at EDIRC.
Bibliographic data for series maintained by Christopher F Baum ().