Loading…
This event has ended. Visit the official site or create your own event on Sched.
Click here to return to main conference site. For a one page, printable overview of the schedule, see this.
Thursday, June 30 • 11:35am - 11:40am
Handling Huge Hierachical Data in R

Log in to save this to your schedule, view media, leave feedback and see who's attending!

In this talk we present an R package for handling huge hierarchical statistical data. Hierarchical means that entities can contain datasets describing or referencing subentities on several levels. By huge we understand data far larger than the main memory. This data is stored in a MySQL database allowing simultaneous access from multiple users, however no SQL knowledge is needed to access this data. The framework is flexible enough to store any kind of data without modifying the database structure. Appropriate data views can be read and updated directly in the form of data frames. Our approach handles references, access rights and graph structures in the data. It uses an object oriented framework supporting data classes and inheritance.

Moderators
avatar for Max

Max

principal software engineer, Posit PBC
Max Kuhn is a software engineer at Posit PBC where he is working on improving R’s modeling capabilities and maintaining about 30 packages, including caret and tidymodels. He has a Ph.D. in Biostatistics. Max was a Senior Director of Nonclinical Statistics at Pfizer Global R&D and... Read More →

Speakers
avatar for Stephan Matos Camacho

Stephan Matos Camacho

Research fellow, Helmholtz Institute Freiberg for Resource Technology


Thursday June 30, 2016 11:35am - 11:40am PDT
Econ 140