Data Workflows with Stata and Python
Stephen Childs and
Dejan Pavlic
Additional contact information
Dejan Pavlic: Education Policy Research Initiative
2015 Stata Conference from Stata Users Group
Abstract:
Python is a general purpose programming language with an large library of packages that extend into domains that Stata does not touch. This talk will identify the key packages from Python that will allow it to work with Stata, primarily the pandas framework. Pandas is a relatively new, but extremely powerful, package for data preparation and analysis that works will with Stata - including support for categorical variables. This talk will discuss some new tools that have been developed to make it easier to connect Stata to Python. We will also discuss using Stata with the IPython Notebook, a tool that allows researchers to combine code and text in an easy to access document. During their work with the Education Policy Research Initiative, the authors have successfully transitioned much complex data preparation from Stata to Python, while still supporting Stata's powerful analytical tools. This talk is ideal for those interested in incorporating some Python into their workflow or planning a larger transition.
Date: 2015-08-02
References: Add references at CitEc
Citations:
Downloads: (external link)
http://repec.org/col2015/columbus15_childs.pdf
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:boc:scon15:14
Access Statistics for this paper
More papers in 2015 Stata Conference from Stata Users Group Contact information at EDIRC.
Bibliographic data for series maintained by Christopher F Baum ().