Industry

Business & Industry Applications, Machine Learning, Finance, Public Health & Bioinformatics, Government, Higher Education Research & Teaching

Language

R, C

Features

Data Wrangling, Data Import and Export, Data Aggregation, High Performance Computing, Big Data, Statistical Computing
Numerical Computing

data.table is an open source R package for data manipulation and analysis, built on C code with optimized algorithms for computing speed and memory efficiency, and with zero external dependencies. data.table is designed for performance, allowing users to perform wrangling tasks on large datasets entirely in-memory with concise and expressive R syntax.

The performance and scalability of data.table make it a staple in essentially all fields of industry and academic institutions. It is widely adopted wherever handling large datasets is critical, such in public health (e.g. Scotland’s NHS), technology giants (e.g. Google and Amazon), and machine learning (e.g. H20.ai). It is also a favored choice in classrooms and research labs for its efficient and concise data workflows.

Be the First to Know

Be the First to Know

New developments and features from our sponsored projects, straight to your inbox, once a month.

New developments and features from our sponsored projects, straight to your inbox, once a month.