data.table - Extension of `data.frame`
Fast aggregation of large data (e.g. 100GB in RAM), fast ordered joins, fast add/modify/delete of columns by group using no copies at all, list columns, friendly and fast character-separated-value read/write. Offers a natural and flexible syntax, for faster development.
Last updated 8 hours ago
23.27 score 3.6k stars 4.5k dependents 226k scripts 605k downloadssf - Simple Features for R
Support for simple feature access, a standardized way to encode and analyze spatial vector data. Binds to 'GDAL' <doi: 10.5281/zenodo.5884351> for reading and writing data, to 'GEOS' <doi: 10.5281/zenodo.11396894> for geometrical operations, and to 'PROJ' <doi: 10.5281/zenodo.5884394> for projection conversions and datum transformations. Uses by default the 's2' package for geometry operations on geodetic (long/lat degree) coordinates.
Last updated 3 days ago
gdalgeosprojspatialcpp
22.25 score 1.4k stars 1.1k dependents 110k scripts 361k downloadsjsonlite - A Simple and Robust JSON Parser and Generator for R
A reasonably fast JSON parser and generator, optimized for statistical data and the web. Offers simple, flexible tools for working with JSON in R, and is particularly powerful for building pipelines and interacting with a web API. The implementation is based on the mapping described in the vignette (Ooms, 2014). In addition to converting JSON data from/to R objects, 'jsonlite' contains functions to stream, validate, and prettify JSON data. The unit tests included with the package verify that all edge cases are encoded and decoded consistently for use with dynamic data in systems and applications.
Last updated 4 months ago
jsonparser
20.88 score 382 stars 8.3k dependents 25k scripts 1.1M downloadsDBI - R Database Interface
A database interface definition for communication between R and relational database management systems. All classes in this package are virtual and need to be extended by the various R/DBMS implementations.
Last updated 2 months ago
databaseinterface
20.81 score 300 stars 2.8k dependents 20k scripts 572k downloadstidyverse - Easily Install and Load the 'Tidyverse'
The 'tidyverse' is a set of packages that work in harmony because they share common data representations and 'API' design. This package is designed to make it easy to install and load multiple 'tidyverse' packages in a single step. Learn more about the 'tidyverse' at <https://www.tidyverse.org>.
Last updated 3 months ago
data-sciencetidyverse
20.10 score 1.7k stars 122 dependents 647k scripts 897k downloadscurl - A Modern and Flexible Web Client for R
Bindings to 'libcurl' <https://curl.se/libcurl/> for performing fully configurable HTTP/FTP requests where responses can be processed in memory, on disk, or streaming via the callback or connection interfaces. Some knowledge of 'libcurl' is recommended; for a more-user-friendly web client see the 'httr2' package which builds on this package with http specific tools and logic.
Last updated 1 hours ago
curl
19.82 score 224 stars 5.7k dependents 3.9k scripts 917k downloadsscales - Scale Functions for Visualization
Graphical scales map data to aesthetics, and provide methods for automatically determining breaks and labels for axes and legends.
Last updated 3 months ago
ggplot2
19.75 score 412 stars 7.8k dependents 90k scripts 828k downloadsdevtools - Tools to Make Developing R Packages Easier
Collection of package development tools.
Last updated 4 months ago
package-creation
19.62 score 2.4k stars 147 dependents 51k scripts 255k downloadspatchwork - The Composer of Plots
The 'ggplot2' package provides a strong API for sequentially building up a plot, but does not concern itself with composition of multiple plots. 'patchwork' is a package that expands the API to allow for arbitrarily complex composition of plots by, among others, providing mathematical operators for combining multiple plots. Other packages that try to address this need (but with a different approach) are 'gridExtra' and 'cowplot'.
Last updated 4 months ago
ggplot-extensionggplot2visualization
19.51 score 2.5k stars 618 dependents 80k scripts 148k downloadsDT - A Wrapper of the JavaScript Library 'DataTables'
Data objects in R can be rendered as HTML tables using the JavaScript library 'DataTables' (typically via R Markdown or Shiny). The 'DataTables' library has been included in this R package. The package name 'DT' is an abbreviation of 'DataTables'.
Last updated 2 months ago
datatableshtmlwidgetsjavascriptshiny
19.34 score 603 stars 651 dependents 38k scripts 353k downloadsjanitor - Simple Tools for Examining and Cleaning Dirty Data
The main janitor functions can: perfectly format data.frame column names; provide quick counts of variable combinations (i.e., frequency tables and crosstabs); and explore duplicate records. Other janitor functions nicely format the tabulation results. These tabulate-and-report functions approximate popular features of SPSS and Microsoft Excel. This package follows the principles of the "tidyverse" and works well with the pipe function %>%. janitor was built with beginning-to-intermediate R users in mind and is optimized for user-friendliness.
Last updated 30 days ago
data-analysisdata-cleaningdata-sciencedirty-dataexcelpivot-tablesspsstabulationstidyverse
19.07 score 1.4k stars 226 dependents 34k scripts 114k downloadsopenxlsx - Read, Write and Edit xlsx Files
Simplifies the creation of Excel .xlsx files by providing a high level interface to writing, styling and editing worksheets. Through the use of 'Rcpp', read/write times are comparable to the 'xlsx' and 'XLConnect' packages with the added benefit of removing the dependency on Java.
Last updated 7 days ago
xlsxcpp
18.97 score 230 stars 260 dependents 20k scripts 363k downloadsRSQLite - SQLite Interface for R
Embeds the SQLite database engine in R and provides an interface compliant with the DBI package. The source for the SQLite engine and for various extensions in a recent version is included. System libraries will never be consulted because this package relies on static linking for the plugins it includes; this also ensures a consistent experience across all installations.
Last updated 2 months ago
databasesqlite3cpp
18.72 score 327 stars 1.1k dependents 8.0k scripts 129k downloadsgt - Easily Create Presentation-Ready Display Tables
Build display tables from tabular data with an easy-to-use set of functions. With its progressive approach, we can construct display tables with a cohesive set of table parts. Table values can be formatted using any of the included formatting functions. Footnotes and cell styles can be precisely added through a location targeting system. The way in which 'gt' handles things for you means that you don't often have to worry about the fine details.
Last updated 2 hours ago
docxeasy-to-usehtmllatexrtfsummary-tables
18.34 score 2.1k stars 107 dependents 19k scripts 256k downloadsplotly - Create Interactive Web Graphics via 'plotly.js'
Create interactive web graphics from 'ggplot2' graphs and/or a custom interface to the (MIT-licensed) JavaScript library 'plotly.js' inspired by the grammar of graphics.
Last updated 1 months ago
d3jsdata-visualizationggplot2javascriptplotlyshinywebgl
18.25 score 2.6k stars 772 dependents 77k scripts 165k downloadstidyselect - Select from a Set of Strings
A backend for the selecting functions of the 'tidyverse'. It makes it easy to implement select-like functions in your own packages in a way that is consistent with other 'tidyverse' interfaces for selection.
Last updated 2 months ago
18.20 score 129 stars 8.1k dependents 1.9k scripts 902k downloadsfuture - Unified Parallel and Distributed Processing in R for Everyone
The purpose of this package is to provide a lightweight and unified Future API for sequential and parallel processing of R expression via futures. The simplest way to evaluate an expression in parallel is to use `x %<-% { expression }` with `plan(multisession)`. This package implements sequential, multicore, multisession, and cluster futures. With these, R expressions can be evaluated on the local machine, in parallel a set of local machines, or distributed on a mix of local and remote machines. Extensions to this package implement additional backends for processing futures via compute cluster schedulers, etc. Because of its unified API, there is no need to modify any code in order switch from sequential on the local machine to, say, distributed processing on a remote compute cluster. Another strength of this package is that global variables and functions are automatically identified and exported as needed, making it straightforward to tweak existing code to make use of futures.
Last updated 20 days ago
asynchronousdistributed-computingfutureshpchpc-clustersparallel-computingparallel-processingparallelizationprogrammingpromises
18.11 score 961 stars 1.2k dependents 15k scripts 202k downloadstinytex - Helper Functions to Install and Maintain TeX Live, and Compile LaTeX Documents
Helper functions to install and maintain the 'LaTeX' distribution named 'TinyTeX' (<https://yihui.org/tinytex/>), a lightweight, cross-platform, portable, and easy-to-maintain version of 'TeX Live'. This package also contains helper functions to compile 'LaTeX' documents, and install missing 'LaTeX' packages automatically.
Last updated 2 months ago
latextexlive
17.62 score 996 stars 3.6k dependents 2.3k scripts 766k downloadshttr2 - Perform HTTP Requests and Process the Responses
Tools for creating and modifying HTTP requests, then performing them and processing the results. 'httr2' is a modern re-imagining of 'httr' that uses a pipe-based interface and solves more of the problems that API wrapping packages face.
Last updated 1 days ago
http
17.48 score 242 stars 1.0k dependents 1.6k scripts 375k downloadsmagick - Advanced Graphics and Image-Processing in R
Bindings to 'ImageMagick': the most comprehensive open-source image processing library available. Supports many common formats (png, jpeg, tiff, pdf, etc) and manipulations (rotate, scale, crop, trim, flip, blur, etc). All operations are vectorized via the Magick++ STL meaning they operate either on a single frame or a series of frames for working with layers, collages, or animation. In RStudio images are automatically previewed when printed to the console, resulting in an interactive editing environment. The latest version of the package includes a native graphics device for creating in-memory graphics or drawing onto images using pixel coordinates.
Last updated 4 months ago
image-manipulationimage-processingimagemagickcpp
17.30 score 466 stars 245 dependents 8.7k scripts 64k downloadsgridExtra - Miscellaneous Functions for "Grid" Graphics
Provides a number of user-level functions to work with "grid" graphics, notably to arrange multiple grid-based plots on a page, and draw tables.
Last updated 4 years ago
17.25 score 15 stars 2.5k dependents 87k scripts 301k downloadsremotes - R Package Installation from Remote Repositories, Including 'GitHub'
Download and install R packages stored in 'GitHub', 'GitLab', 'Bitbucket', 'Bioconductor', or plain 'subversion' or 'git' repositories. This package provides the 'install_*' functions in 'devtools'. Indeed most of the code was copied over from 'devtools'.
Last updated 13 days ago
17.05 score 339 stars 244 dependents 4.9k scripts 432k downloadsflextable - Functions for Tabular Reporting
Use a grammar for creating and customizing pretty tables. The following formats are supported: 'HTML', 'PDF', 'RTF', 'Microsoft Word', 'Microsoft PowerPoint' and R 'Grid Graphics'. 'R Markdown', 'Quarto' and the package 'officer' can be used to produce the result files. The syntax is the same for the user regardless of the type of output to be produced. A set of functions allows the creation, definition of cell arrangement, addition of headers or footers, formatting and definition of cell content with text and or images. The package also offers a set of high-level functions that allow tabular reporting of statistical models and the creation of complex cross tabulations.
Last updated 2 months ago
docxhtml5ms-office-documentsrmarkdowntable
16.90 score 571 stars 116 dependents 7.3k scripts 45k downloadsgtsummary - Presentation-Ready Data Summary and Analytic Result Tables
Creates presentation-ready tables summarizing data sets, regression models, and more. The code to create the tables is concise and highly customizable. Data frames can be summarized with any function, e.g. mean(), median(), even user-written functions. Regression models are summarized and include the reference rows for categorical variables. Common regression models, such as logistic regression and Cox proportional hazards regression, are automatically identified and the tables are pre-filled with appropriate column headers.
Last updated 5 days ago
easy-to-usegthtml5regression-modelsreproducibilityreproducible-researchstatisticssummary-statisticssummary-tablestable1tableone
16.61 score 1.1k stars 13 dependents 7.9k scripts 30k downloadscollapse - Advanced and Fast Data Transformation
A C/C++ based package for advanced data transformation and statistical computing in R that is extremely fast, class-agnostic, robust and programmer friendly. Core functionality includes a rich set of S3 generic grouped and weighted statistical functions for vectors, matrices and data frames, which provide efficient low-level vectorizations, OpenMP multithreading, and skip missing values by default. These are integrated with fast grouping and ordering algorithms (also callable from C), and efficient data manipulation functions. The package also provides a flexible and rigorous approach to time series and panel data in R. It further includes fast functions for common statistical procedures, detailed (grouped, weighted) summary statistics, powerful tools to work with nested data, fast data object conversions, functions for memory efficient R programming, and helpers to effectively deal with variable labels, attributes, and missing data. It is well integrated with base R classes, 'dplyr'/'tibble', 'data.table', 'sf', 'units', 'plm' (panel-series and data frames), and 'xts'/'zoo'.
Last updated 13 days ago
data-aggregationdata-analysisdata-manipulationdata-processingdata-sciencedata-transformationeconometricshigh-performancepanel-datascientific-computingstatisticstime-seriesweightedweightscppopenmp
16.48 score 670 stars 93 dependents 768 scripts 23k downloadsstyler - Non-Invasive Pretty Printing of R Code
Pretty-prints R code without changing the user's formatting intent.
Last updated 1 months ago
pretty-print
16.28 score 742 stars 60 dependents 980 scripts 55k downloadsskimr - Compact and Flexible Summaries of Data
A simple to use summary function that can be used with pipes and displays nicely in the console. The default summary statistics may be modified by the user as can the default formatting. Support for data frames and vectors is included, and users can implement their own skim methods for specific object types as described in a vignette. Default summaries include support for inline spark graphs. Instructions for managing these on specific operating systems are given in the "Using skimr" vignette and the README.
Last updated 2 years ago
peer-reviewedropenscisummary-statisticsunconfunconf17
16.24 score 1.1k stars 13 dependents 15k scripts 37k downloadsggthemes - Extra Themes, Scales and Geoms for 'ggplot2'
Some extra themes, geoms, and scales for 'ggplot2'. Provides 'ggplot2' themes and scales that replicate the look of plots by Edward Tufte, Stephen Few, 'Fivethirtyeight', 'The Economist', 'Stata', 'Excel', and 'The Wall Street Journal', among others. Provides 'geoms' for Tufte's box plot and range frame.
Last updated 11 months ago
data-visualisationggplot2ggplot2-themesplotplottingthemevisualization
16.23 score 1.3k stars 97 dependents 39k scripts 43k downloadsodbc - Connect to ODBC Compatible Databases (using the DBI Interface)
A DBI-compatible interface to ODBC databases.
Last updated 22 hours ago
databaseodbcunixodbccpp
16.12 score 394 stars 22 dependents 3.0k scripts 49k downloadstidymodels - Easily Install and Load the 'Tidymodels' Packages
The tidy modeling "verse" is a collection of packages for modeling and statistical analysis that share the underlying design philosophy, grammar, and data structures of the tidyverse.
Last updated 3 months ago
16.11 score 774 stars 14 dependents 65k scripts 25k downloadsspatstat - Spatial Point Pattern Analysis, Model-Fitting, Simulation, Tests
Comprehensive open-source toolbox for analysing Spatial Point Patterns. Focused mainly on two-dimensional point patterns, including multitype/marked points, in any spatial region. Also supports three-dimensional point patterns, space-time point patterns in any number of dimensions, point patterns on a linear network, and patterns of other geometrical objects. Supports spatial covariate data such as pixel images. Contains over 3000 functions for plotting spatial data, exploratory data analysis, model-fitting, simulation, spatial sampling, model diagnostics, and formal inference. Data types include point patterns, line segment patterns, spatial windows, pixel images, tessellations, and linear networks. Exploratory methods include quadrat counts, K-functions and their simulation envelopes, nearest neighbour distance and empty space statistics, Fry plots, pair correlation function, kernel smoothed intensity, relative risk estimation with cross-validated bandwidth selection, mark correlation functions, segregation indices, mark dependence diagnostics, and kernel estimates of covariate effects. Formal hypothesis tests of random pattern (chi-squared, Kolmogorov-Smirnov, Monte Carlo, Diggle-Cressie-Loosmore-Ford, Dao-Genton, two-stage Monte Carlo) and tests for covariate effects (Cox-Berman-Waller-Lawson, Kolmogorov-Smirnov, ANOVA) are also supported. Parametric models can be fitted to point pattern data using the functions ppm(), kppm(), slrm(), dppm() similar to glm(). Types of models include Poisson, Gibbs and Cox point processes, Neyman-Scott cluster processes, and determinantal point processes. Models may involve dependence on covariates, inter-point interaction, cluster formation and dependence on marks. Models are fitted by maximum likelihood, logistic regression, minimum contrast, and composite likelihood methods. A model can be fitted to a list of point patterns (replicated point pattern data) using the function mppm(). The model can include random effects and fixed effects depending on the experimental design, in addition to all the features listed above. Fitted point process models can be simulated, automatically. Formal hypothesis tests of a fitted model are supported (likelihood ratio test, analysis of deviance, Monte Carlo tests) along with basic tools for model selection (stepwise(), AIC()) and variable selection (sdr). Tools for validating the fitted model include simulation envelopes, residuals, residual plots and Q-Q plots, leverage and influence diagnostics, partial residuals, and added variable plots.
Last updated 21 hours ago
cluster-processcox-point-processgibbs-processkernel-densitynetwork-analysispoint-processpoisson-processspatial-analysisspatial-dataspatial-data-analysisspatial-statisticsspatstatstatistical-methodsstatistical-modelsstatistical-testsstatistics
16.02 score 198 stars 41 dependents 5.4k scripts 12k downloadstmap - Thematic Maps
Thematic maps are geographical maps in which spatial data distributions are visualized. This package offers a flexible, layer-based, and easy to use approach to create thematic maps, such as choropleths and bubble maps.
Last updated 8 days ago
choropleth-mapsmapsspatialthematic-mapsvisualisation
15.81 score 877 stars 22 dependents 12k scripts 15k downloadssjPlot - Data Visualization for Statistics in Social Science
Collection of plotting and table output functions for data visualization. Results of various statistical analyses (that are commonly used in social sciences) can be visualized using this package, including simple and cross tabulated frequencies, histograms, box plots, (generalized) linear models, mixed effects models, principal component analysis and correlation matrices, cluster analyses, scatter plots, stacked scales, effects plots of regression models (including interaction terms) and much more. This package supports labelled data.
Last updated 2 months ago
data-visualizationplottingsocial-sciencesstatistics
15.77 score 617 stars 3 dependents 7.4k scripts 24k downloadsofficer - Manipulation of Microsoft Word and PowerPoint Documents
Access and manipulate 'Microsoft Word', 'RTF' and 'Microsoft PowerPoint' documents from R. The package focuses on tabular and graphical reporting from R; it also provides two functions that let users get document content into data objects. A set of functions lets add and remove images, tables and paragraphs of text in new or existing documents. The package does not require any installation of Microsoft products to be able to write Microsoft files.
Last updated 20 days ago
ms-office-documentspowerpointword
15.76 score 619 stars 135 dependents 4.1k scripts 58k downloadsparameters - Processing of Model Parameters
Utilities for processing the parameters of various statistical models. Beyond computing p values, CIs, and other indices for a wide variety of models (see list of supported models using the function 'insight::supported_models()'), this package implements features like bootstrapping or simulating of parameters and models, feature reduction (feature extraction and variable selection) as well as functions to describe data and variable characteristics (e.g. skewness, kurtosis, smoothness or distribution).
Last updated 8 days ago
betabootstrapciconfidence-intervalsdata-reductioneasystatsfafeature-extractionfeature-reductionhacktoberfestparameterspcapvaluesregression-modelsrobust-statisticsstandardizestandardized-estimatesstatistical-models
15.37 score 448 stars 54 dependents 1.8k scripts 47k downloadsstringdist - Approximate String Matching, Fuzzy Text Search, and String Distance Functions
Implements an approximate string matching version of R's native 'match' function. Also offers fuzzy text search based on various string distance measures. Can calculate various string distances based on edits (Damerau-Levenshtein, Hamming, Levenshtein, optimal sting alignment), qgrams (q- gram, cosine, jaccard distance) or heuristic metrics (Jaro, Jaro-Winkler). An implementation of soundex is provided as well. Distances can be computed between character vectors while taking proper care of encoding or between integer vectors representing generic sequences. This package is built for speed and runs in parallel by using 'openMP'. An API for C or C++ is exposed as well. Reference: MPJ van der Loo (2014) <doi:10.32614/RJ-2014-011>.
Last updated 1 months ago
openmp
15.27 score 324 stars 153 dependents 2.0k scripts 42k downloadszip - Cross-Platform 'zip' Compression
Cross-Platform 'zip' Compression Library. A replacement for the 'zip' function, that does not require any additional external tools on any platform.
Last updated 16 days ago
15.25 score 84 stars 855 dependents 924 scripts 362k downloadsformatR - Format R Code Automatically
Provides a function tidy_source() to format R source code. Spaces and indent will be added to the code automatically, and comments will be preserved under certain conditions, so that R code will be more human-readable and tidy. There is also a Shiny app as a user interface in this package (see tidy_app()).
Last updated 1 years ago
code-formatter
15.10 score 257 stars 1.2k dependents 1.6k scripts 69k downloadsggstatsplot - 'ggplot2' Based Plots with Statistical Details
Extension of 'ggplot2', 'ggstatsplot' creates graphics with details from statistical tests included in the plots themselves. It provides an easier syntax to generate information-rich plots for statistical analysis of continuous (violin plots, scatterplots, histograms, dot plots, dot-and-whisker plots) or categorical (pie and bar charts) data. Currently, it supports the most common types of statistical approaches and tests: parametric, nonparametric, robust, and Bayesian versions of t-test/ANOVA, correlation analyses, contingency table analysis, meta-analysis, and regression analyses. References: Patil (2021) <doi:10.21105/joss.03236>.
Last updated 1 months ago
bayes-factorsdatasciencedatavizeffect-sizeggplot-extensionhypothesis-testingnon-parametric-statisticsregression-modelsstatistical-analysis
14.70 score 2.1k stars 1 dependents 2.7k scripts 16k downloadsRPostgres - C++ Interface to PostgreSQL
Fully DBI-compliant C++-backed interface to PostgreSQL <https://www.postgresql.org/>, an open-source relational database.
Last updated 1 months ago
databasepostgrespostgresqlcpp
14.68 score 335 stars 29 dependents 1.6k scripts 29k downloadsRColorBrewer - ColorBrewer Palettes
Provides color schemes for maps (and other graphics) designed by Cynthia Brewer as described at http://colorbrewer2.org.
Last updated 3 years ago
14.21 score 8 stars 8.0k dependents 109k scripts 770k downloadspacman - Package Management Tool
Tools to more conveniently perform tasks associated with add-on packages. pacman conveniently wraps library and package related functions and names them in an intuitive and consistent fashion. It seeks to combine functionality from lower level functions which can speed up workflow.
Last updated 5 years ago
githubpackage-managementpackages
14.14 score 314 stars 10 dependents 13k scripts 247k downloadsmapview - Interactive Viewing of Spatial Data in R
Quickly and conveniently create interactive visualisations of spatial data with or without background maps. Attributes of displayed features are fully queryable via pop-up windows. Additional functionality includes methods to visualise true- and false-color raster images and bounding boxes.
Last updated 1 months ago
gisleafletmapsspatialvisualizationweb-mapping
14.10 score 522 stars 25 dependents 7.6k scripts 11k downloadsextrafont - Tools for Using Fonts
Tools to using fonts other than the standard PostScript fonts. This package makes it easy to use system TrueType fonts and with PDF or PostScript output files, and with bitmap output files in Windows. extrafont can also be used with fonts packaged specifically to be used with, such as the fontcm package, which has Computer Modern PostScript fonts with math symbols.
Last updated 2 years ago
13.99 score 318 stars 51 dependents 13k scripts 32k downloadspaletteer - Comprehensive Collection of Color Palettes
The choices of color palettes in R can be quite overwhelming with palettes spread over many packages with many different API's. This packages aims to collect all color palettes across the R ecosystem under the same package with a streamlined API.
Last updated 7 months ago
color-palettepalettes
13.90 score 947 stars 22 dependents 6.8k scripts 158k downloadstidycensus - Load US Census Boundary and Attribute Data as 'tidyverse' and 'sf'-Ready Data Frames
An integrated R interface to several United States Census Bureau APIs (<https://www.census.gov/data/developers/data-sets.html>) and the US Census Bureau's geographic boundary files. Allows R users to return Census and ACS data as tidyverse-ready data frames, and optionally returns a list-column with feature geometry for mapping and spatial analysis.
Last updated 3 months ago
13.85 score 643 stars 9 dependents 7.5k scripts 5.3k downloadshrbrthemes - Additional Themes, Theme Components and Utilities for 'ggplot2'
A compilation of extra 'ggplot2' themes, scales and utilities, including a spell check function for plot label fields and an overall emphasis on typography. A copy of the 'Google' font 'Roboto Condensed' is also included.
Last updated 7 days ago
data-visualizationdatavisualizationggplot-extensionggplot2ggplot2-scalesggplot2-themesvisualization
13.64 score 1.3k stars 15 dependents 13k scripts 11k downloadsduckdb - DBI Package for the DuckDB Database Management System
The DuckDB project is an embedded analytical data management system with support for the Structured Query Language (SQL). This package includes all of DuckDB and an R Database Interface (DBI) connector.
Last updated 4 hours ago
databaseduckdbolapcpp
13.37 score 147 stars 46 dependents 1.5k scripts 18k downloadsopenxlsx2 - Read, Write and Edit 'xlsx' Files
Simplifies the creation of 'xlsx' files by providing a high level interface to writing, styling and editing worksheets.
Last updated 7 days ago
xlsxcpp
13.35 score 130 stars 9 dependents 210 scripts 4.5k downloadstigris - Load Census TIGER/Line Shapefiles
Download TIGER/Line shapefiles from the United States Census Bureau (<https://www.census.gov/geographies/mapping-files/time-series/geo/tiger-line-file.html>) and load into R as 'sf' objects.
Last updated 3 months ago
13.04 score 325 stars 15 dependents 5.3k scripts 9.8k downloadsesquisse - Explore and Visualize Your Data Interactively
A 'shiny' gadget to create 'ggplot2' figures interactively with drag-and-drop to map your variables to different aesthetics. You can quickly visualize your data accordingly to their type, export in various formats, and retrieve the code to reproduce the plot.
Last updated 13 days ago
addindata-visualizationggplot2rstudio-addinvisualization
12.98 score 1.8k stars 1 dependents 1.1k scripts 5.0k downloadsshowtext - Using Fonts More Easily in R Graphs
Making it easy to use various types of fonts ('TrueType', 'OpenType', Type 1, web fonts, etc.) in R graphs, and supporting most output formats of R graphics including PNG, PDF and SVG. Text glyphs will be converted into polygons or raster images, hence after the plot has been created, it no longer relies on the font files. No external software such as 'Ghostscript' is needed to use this package.
Last updated 11 months ago
fontgraphicsgraphics-devicer-graphicsfreetype
12.84 score 483 stars 35 dependents 12k scripts 23k downloadspdftools - Text Extraction, Rendering and Converting of PDF Documents
Utilities based on 'libpoppler' <https://poppler.freedesktop.org> for extracting text, fonts, attachments and metadata from a PDF file. Also supports high quality rendering of PDF documents into PNG, JPEG, TIFF format, or into raw bitmap vectors for further processing in R.
Last updated 4 months ago
pdf-filespdf-formatpdftoolspopplerpoppler-librarytext-extractioncpp
12.83 score 527 stars 44 dependents 3.3k scripts 24k downloadspak - Another Approach to Package Installation
The goal of 'pak' is to make package installation faster and more reliable. In particular, it performs all HTTP operations in parallel, so metadata resolution and package downloads are fast. Metadata and package files are cached on the local disk as well. 'pak' has a dependency solver, so it finds version conflicts before performing the installation. This version of 'pak' supports CRAN, 'Bioconductor' and 'GitHub' packages as well.
Last updated 13 days ago
12.72 score 699 stars 16 dependents 209 scripts 16k downloadsstplanr - Sustainable Transport Planning
Tools for transport planning with an emphasis on spatial transport data and non-motorized modes. The package was originally developed to support the 'Propensity to Cycle Tool', a publicly available strategic cycle network planning tool (Lovelace et al. 2017) <doi:10.5198/jtlu.2016.862>, but has since been extended to support public transport routing and accessibility analysis (Moreno-Monroy et al. 2017) <doi:10.1016/j.jtrangeo.2017.08.012> and routing with locally hosted routing engines such as 'OSRM' (Lowans et al. 2023) <doi:10.1016/j.enconman.2023.117337>. The main functions are for creating and manipulating geographic "desire lines" from origin-destination (OD) data (building on the 'od' package); calculating routes on the transport network locally and via interfaces to routing services such as <https://cyclestreets.net/> (Desjardins et al. 2021) <doi:10.1007/s11116-021-10197-1>; and calculating route segment attributes such as bearing. The package implements the 'travel flow aggregration' method described in Morgan and Lovelace (2020) <doi:10.1177/2399808320942779> and the 'OD jittering' method described in Lovelace et al. (2022) <doi:10.32866/001c.33873>. Further information on the package's aim and scope can be found in the vignettes and in a paper in the R Journal (Lovelace and Ellison 2018) <doi:10.32614/RJ-2018-053>, and in a paper outlining the landscape of open source software for geographic methods in transport planning (Lovelace, 2021) <doi:10.1007/s10109-020-00342-2>.
Last updated 5 months ago
cyclecyclingdesire-linesorigin-destinationpeer-reviewedpubic-transportroute-networkroutesroutingspatialtransporttransport-planningtransportationwalking
12.02 score 424 stars 2 dependents 676 scripts 1.4k downloadspolars - Lightning-Fast 'DataFrame' Library
Lightning-fast 'DataFrame' library written in 'Rust'. Convert R data to 'Polars' data and vice versa. Perform fast, lazy, larger-than-memory and optimized data queries. 'Polars' is interoperable with the package 'arrow', as both are based on the 'Apache Arrow' Columnar Format.
Last updated 4 days ago
arrowpolarsrust
12.00 score 489 stars 2 dependents 1.1k scripts 10 downloadsgtExtras - Extending 'gt' for Beautiful HTML Tables
Provides additional functions for creating beautiful tables with 'gt'. The functions are generally wrappers around boilerplate or adding opinionated niche capabilities and helpers functions.
Last updated 10 months ago
data-sciencedata-visualizationdatascienceggplot2gtplotssparklinesparkline-graphssparklinestables
11.53 score 198 stars 2 dependents 2.4k scripts 147k downloadsdlookr - Tools for Data Diagnosis, Exploration, Transformation
A collection of tools that support data diagnosis, exploration, and transformation. Data diagnostics provides information and visualization of missing values, outliers, and unique and negative values to help you understand the distribution and quality of your data. Data exploration provides information and visualization of the descriptive statistics of univariate variables, normality tests and outliers, correlation of two variables, and the relationship between the target variable and predictor. Data transformation supports binning for categorizing continuous variables, imputes missing values and outliers, and resolves skewness. And it creates automated reports that support these three tasks.
Last updated 8 months ago
11.52 score 211 stars 2 dependents 744 scripts 2.5k downloadsrayshader - Create Maps and Visualize Data in 2D and 3D
Uses a combination of raytracing and multiple hill shading methods to produce 2D and 3D data visualizations and maps. Includes water detection and layering functions, programmable color palette generation, several built-in textures for hill shading, 2D and 3D plotting options, a built-in path tracer, 'Wavefront' OBJ file export, and the ability to save 3D visualizations to a 3D printable format.
Last updated 16 days ago
cpp
11.48 score 2.1k stars 5 dependents 1.5k scripts 2.0k downloadsforeign - Read Data Stored by 'Minitab', 'S', 'SAS', 'SPSS', 'Stata', 'Systat', 'Weka', 'dBase', ...
Reading and writing data stored by some versions of 'Epi Info', 'Minitab', 'S', 'SAS', 'SPSS', 'Stata', 'Systat', 'Weka', and for reading and writing some 'dBase' files.
Last updated 10 days ago
11.36 score 6 stars 889 dependents 15k scripts 48k downloadsarchive - Multi-Format Archive and Compression Support
Bindings to 'libarchive' <http://www.libarchive.org> the Multi-format archive and compression library. Offers R connections and direct extraction for many archive formats including 'tar', 'ZIP', '7-zip', 'RAR', 'CAB' and compression formats including 'gzip', 'bzip2', 'compress', 'lzma' and 'xz'.
Last updated 3 months ago
compressionconnectionslibarchivecpp
11.33 score 145 stars 25 dependents 494 scripts 6.6k downloadstmaptools - Thematic Map Tools
Set of tools for reading and processing spatial data. The aim is to supply the workflow to create thematic maps. This package also facilitates 'tmap', the package for visualizing thematic maps.
Last updated 9 days ago
11.29 score 42 stars 24 dependents 1.8k scripts 12k downloadstidytable - Tidy Interface to 'data.table'
A tidy interface to 'data.table', giving users the speed of 'data.table' while using tidyverse-like syntax.
Last updated 1 days ago
11.21 score 455 stars 10 dependents 696 scripts 2.2k downloadsmapsf - Thematic Cartography
Create and integrate thematic maps in your workflow. This package helps to design various cartographic representations such as proportional symbols, choropleth or typology maps. It also offers several functions to display layout elements that improve the graphic presentation of maps (e.g. scale bar, north arrow, title, labels). 'mapsf' maps 'sf' objects on 'base' graphics.
Last updated 9 days ago
cartographymapspatialspatial-analysis
11.07 score 227 stars 9 dependents 400 scripts 1.9k downloadssfheaders - Converts Between R Objects and Simple Feature Objects
Converts between R and Simple Feature 'sf' objects, without depending on the Simple Feature library. Conversion functions are available at both the R level, and through 'Rcpp'.
Last updated 6 months ago
rcppsimple-featurescpp
11.03 score 74 stars 146 dependents 384 scripts 29k downloadsipumsr - An R Interface for Downloading, Reading, and Handling IPUMS Data
An easy way to work with census, survey, and geographic data provided by IPUMS in R. Generate and download data through the IPUMS API and load IPUMS files into R with their associated metadata to make analysis easier. IPUMS data describing 1.4 billion individuals drawn from over 750 censuses and surveys is available free of charge from the IPUMS website <https://www.ipums.org>.
Last updated 22 days ago
10.95 score 26 stars 2 dependents 820 scripts 2.4k downloadsstargazer - Well-Formatted Regression and Summary Statistics Tables
Produces LaTeX code, HTML/CSS code and ASCII text for well-formatted tables that hold regression analysis results from several models side-by-side, as well as summary statistics.
Last updated 3 years ago
10.84 score 35 stars 10 dependents 12k scripts 55k downloadstidytransit - Read, Validate, Analyze, and Map GTFS Feeds
Read General Transit Feed Specification (GTFS) zipfiles into a list of R dataframes. Perform validation of the data structure against the specification. Analyze the headways and frequencies at routes and stops. Create maps and perform spatial analysis on the routes and stops. Please see the GTFS documentation here for more detail: <https://gtfs.org/>.
Last updated 2 days ago
gtfspublicpublic-transporttidyversetransittransit-datatransporttransportation
10.48 score 151 stars 1 dependents 312 scripts 910 downloadsdatapasta - R Tools for Data Copy-Pasta
RStudio addins and R functions that make copy-pasting vectors and tables to text painless.
Last updated 3 years ago
addinclipboardcopypasteexceltibble
10.20 score 896 stars 2 dependents 285 scripts 1.0k downloadstictoc - Functions for Timing R Scripts, as Well as Implementations of "Stack" and "StackList" Structures
Code execution timing functions 'tic' and 'toc' that can be nested. One can record all timings while a complex script is running, and examine the values later. It is also possible to instrument the timing calls with custom callbacks. In addition, this package provides class 'Stack', implemented as a vector, and class 'StackList', which is a stack implemented as a list, both of which support operations 'push', 'pop', 'first_element', 'last_element' and 'clear'.
Last updated 10 months ago
10.17 score 7 stars 43 dependents 12k scripts 14k downloadsrvg - R Graphics Devices for 'Office' Vector Graphics Output
Vector Graphics devices for 'Microsoft PowerPoint' and 'Microsoft Excel'. Functions extending package 'officer' are provided to embed 'DrawingML' graphics into 'Microsoft PowerPoint' presentations and 'Microsoft Excel' workbooks.
Last updated 3 months ago
libpngcpp
9.87 score 131 stars 19 dependents 258 scripts 13k downloadscensusapi - Retrieve Data from the Census APIs
A wrapper for the U.S. Census Bureau APIs that returns data frames of Census data and metadata. Available datasets include the Decennial Census, American Community Survey, Small Area Health Insurance Estimates, Small Area Income and Poverty Estimates, Population Estimates and Projections, and more.
Last updated 10 months ago
censuscensus-apicensus-datademographicsopen-data
9.68 score 173 stars 6 dependents 704 scripts 1.1k downloadsr5r - Rapid Realistic Routing with 'R5'
Rapid realistic routing on multimodal transport networks (walk, bike, public transport and car) using 'R5', the Rapid Realistic Routing on Real-world and Reimagined networks engine <https://github.com/conveyal/r5>. The package allows users to generate detailed routing analysis or calculate travel time and monetary cost matrices using seamless parallel computing on top of the R5 Java machine. While R5 is developed by Conveyal, the package r5r is independently developed by a team at the Institute for Applied Economic Research (Ipea) with contributions from collaborators. Apart from the documentation in this package, users will find additional information on R5 documentation at <https://docs.conveyal.com/>. Although we try to keep new releases of r5r in synchrony with R5, the development of R5 follows Conveyal's independent update process. Hence, users should confirm the R5 version implied by the Conveyal user manual (see <https://docs.conveyal.com/changelog>) corresponds with the R5 version that r5r depends on. This version of r5r depends on R5 v7.1.
Last updated 5 days ago
gtfsjavar5routerroutingtransporttransport-networksopenjdk
9.52 score 184 stars 444 scripts 621 downloadsduckplyr - A 'DuckDB'-Backed Version of 'dplyr'
A drop-in replacement for 'dplyr', powered by 'DuckDB' for performance. Offers convenient utilities for working with in-memory and larger-than-memory data while retaining full 'dplyr' compatibility.
Last updated 6 hours ago
analyticsdataframedplyrduckdbperformance
9.38 score 303 stars 192 scripts 744 downloadsgtfstools - General Transit Feed Specification (GTFS) Editing and Analysing Tools
Utility functions to read, manipulate, analyse and write transit feeds in the General Transit Feed Specification (GTFS) data format.
Last updated 1 days ago
gtfspublic-transportpublictransportcpp
9.31 score 40 stars 3 dependents 126 scripts 947 downloadsjmv - The 'jamovi' Analyses
A suite of common statistical methods such as descriptives, t-tests, ANOVAs, regression, correlation matrices, proportion tests, contingency tables, and factor analysis. This package is also useable from the 'jamovi' statistical spreadsheet (see <https://www.jamovi.org> for more information).
Last updated 2 days ago
9.23 score 59 stars 434 scripts 1.7k downloadscartogram - Create Cartograms with R
Construct continuous and non-contiguous area cartograms.
Last updated 9 days ago
9.00 score 149 stars 2 dependents 728 scripts 2.6k downloadsgtfsrouter - Routing with 'GTFS' (General Transit Feed Specification) Data
Use 'GTFS' (General Transit Feed Specification) data for routing from nominated start and end stations, for extracting 'isochrones', and travel times from any nominated start station to all other stations.
Last updated 3 months ago
gtfsgtfsrouterpublic-transportationroutercpp
8.89 score 85 stars 3 dependents 21 scripts 668 downloadspolite - Be Nice on the Web
Be responsible when scraping data from websites by following polite principles: introduce yourself, ask for permission, take slowly and never ask twice.
Last updated 1 years ago
crawlermemoiserate-limiterrobotstxtrvestscraperwebscraping
8.87 score 326 stars 4 dependents 600 scripts 1.3k downloadsnanoparquet - Read and Write 'Parquet' Files
Self-sufficient reader and writer for flat 'Parquet' files. Can read most 'Parquet' data types. Can write many 'R' data types, including factors and temporal types. See docs for limitations.
Last updated 5 days ago
parquetcpp
8.77 score 55 stars 8 dependents 85 scripts 2.4k downloadsareal - Areal Weighted Interpolation
A pipeable, transparent implementation of areal weighted interpolation with support for interpolating multiple variables in a single function call. These tools provide a full-featured workflow for validation and estimation that fits into both modern data management (e.g. tidyverse) and spatial data (e.g. sf) frameworks.
Last updated 3 years ago
8.76 score 92 stars 4 dependents 107 scripts 1.1k downloadsRSocrata - Download or Upload 'Socrata' Data Sets
Provides easier interaction with 'Socrata' open data portals <https://dev.socrata.com>. Users can provide a 'Socrata' data set resource URL, or a 'Socrata' Open Data API (SODA) web query, or a 'Socrata' "human-friendly" URL, returns an R data frame. Converts dates to 'POSIX' format and manages throttling by 'Socrata'. Users can upload data to 'Socrata' portals directly from R.
Last updated 1 years ago
chicagogovernmentopen-datasocratasoda
7.83 score 236 stars 792 scripts 1.2k downloadsopenrouteservice - An 'openrouteservice' API Client
The client streamlines access to the services provided by <https://api.openrouteservice.org>. It allows you to painlessly query for directions, isochrones, time-distance matrices, geocoding, elevation, points of interest, and more.
Last updated 3 months ago
apidirectionsgisgiscienceisochronesopenrouteserviceopenstreetmappoisroutingsdk
7.80 score 104 stars 60 scripts 358 downloadsdataMaid - A Suite of Checks for Identification of Potential Errors in a Data Frame as Part of the Data Screening Process
Data screening is an important first step of any statistical analysis. dataMaid auto generates a customizable data report with a thorough summary of the checks and the results that a human can use to identify possible errors. It provides an extendable suite of test for common potential errors in a dataset.
Last updated 3 years ago
data-cleaningdata-screeningreproducible-research
7.53 score 143 stars 236 scripts 622 downloadssfdep - Spatial Dependence for Simple Features
An interface to 'spdep' to integrate with 'sf' objects and the 'tidyverse'.
Last updated 5 months ago
r-spatialspatial
7.30 score 129 stars 130 scripts 641 downloadsggThemeAssist - Add-in to Customize 'ggplot2' Themes
Rstudio add-in that delivers a graphical interface for editing 'ggplot2' theme elements.
Last updated 6 years ago
7.27 score 490 stars 251 scripts 689 downloadsgtfsio - Read and Write General Transit Feed Specification (GTFS) Files
Tools for the development of packages related to General Transit Feed Specification (GTFS) files. Establishes a standard for representing GTFS feeds using R data types. Provides fast and flexible functions to read and write GTFS feeds while sticking to this standard. Defines a basic 'gtfs' class which is meant to be extended by packages that depend on it. And offers utility functions that support checking the structure of GTFS objects.
Last updated 1 days ago
gtfs
7.23 score 14 stars 5 dependents 13 scripts 1.0k downloadsmschart - Chart Generation for 'Microsoft Word' and 'Microsoft PowerPoint' Documents
Create native charts for 'Microsoft PowerPoint' and 'Microsoft Word' documents. These can then be edited and annotated. Functions are provided to let users create charts, modify and format their content. The chart's underlying data is automatically saved within the 'Word' document or 'PowerPoint' presentation. It extends package 'officer' that does not contain any feature for 'Microsoft' native charts production.
Last updated 9 months ago
ms-office-documentsofficepowerpointword
7.17 score 132 stars 1 dependents 106 scripts 1.2k downloadsGWalkR - Interactive Exploratory Data Analysis Tool
Simplify your R data analysis and data visualization workflow by turning your data frame into an interactive 'Tableau'-like interface, leveraging the 'graphic-walker' JavaScript library and the 'htmlwidgets' package.
Last updated 2 days ago
data-visualizationggplot2tableau-alternativevisualization
7.09 score 443 stars 46 scripts 731 downloadslehdr - Grab Longitudinal Employer-Household Dynamics (LEHD) Flat Files
Designed to query Longitudinal Employer-Household Dynamics (LEHD) workplace/residential association and origin-destination flat files and optionally aggregate Census block-level data to block group, tract, county, or state. Data comes from the LODES FTP server <https://lehd.ces.census.gov/data/lodes/LODES8/>.
Last updated 2 months ago
7.04 score 61 stars 90 scripts 381 downloadspandoc - Manage and Run Universal Converter 'Pandoc' from 'R'
Provides a set of tools to install, manage and run several 'Pandoc' versions.
Last updated 11 months ago
7.01 score 27 stars 3 dependents 183 scripts 4.6k downloadsannotater - Annotate Package Load Calls
Provides non-invasive annotation of package load calls such as \code{library()}, \code{p_load()}, and \code{require()} so that we can have an idea of what the packages we are loading are meant for.
Last updated 4 months ago
6.93 score 101 stars 21 scripts 239 downloadscransays - Creates an Overview of CRAN Incoming Submissions
It scrapes the CRAN incoming FTP folder to find where each submission is.
Last updated 3 days ago
6.49 score 81 starsbbplot - MAKING GGPLOT GRAPHICS IN BBC NEWS STYLE
Helps create and export ggplot2 charts in the style used by the BBC News data team thanks to two useful functions, one for styling and another for exporting ggplot graphics
Last updated 6 years ago
6.44 score 1.6k stars 354 scriptsdataReporter - Reproducible Data Screening Checks and Report of Possible Errors
Data screening is an important first step of any statistical analysis. 'dataReporter' auto generates a customizable data report with a thorough summary of the checks and the results that a human can use to identify possible errors. It provides an extendable suite of test for common potential errors in a dataset. See Petersen AH, Ekstrøm CT (2019). "dataMaid: Your Assistant for Documenting Supervised Data Quality Screening in R." _Journal of Statistical Software_, *90*(6), 1-38 <doi:10.18637/jss.v090.i06> for more information.
Last updated 2 years ago
6.16 score 86 stars 34 scripts 232 downloadsoverturemapsr - Download Overture Maps Data in R
Overture Maps offers free and open geospatial map data sourced from various providers and standardized to a common schema. This tool allows you to download Overture Maps data for a specific region of interest and convert it to several different file formats. For more information, visit <https://overturemaps.org/download/>.
Last updated 5 months ago
datageospatiallocationopendataosmosmdataoverturemaps
4.77 score 14 stars 14 scripts 149 downloadstidyqwi - A Convenient API for Accessing United States Census Bureau's Quarterly Workforce Indicator
The purpose of this package is to access the United States Census Bureau's Quarterly Workforce Indicator data. Additionally, the data will be retrieved in a tidy format for further manipulation with full variable descriptions added if desired. Information about the United States Census Bureau's Quarterly Workforce Indicator is available at <https://www.census.gov/data/developers/data-sets/qwi.html>.
Last updated 9 months ago
4.76 score 4 stars 36 scripts 47 downloadsntdr - Retrieve Data from the National Transit Database
Downloads the latest 'National Transit Database' data, processes it, and returns in a tidy data format.
Last updated 3 months ago
4.48 score 3 stars 7 scripts 500 downloadsglitter - glitter makes SPARQL
This package aims at writing and sending SPARQL queries. It makes the exploration and use of Linked Open Data (Wikidata in particular) easier for those who do not know SPARQL.
Last updated 1 years ago
rstatsparql
4.11 score 46 stars 28 scriptsarcgis - ArcGIS Location Services Meta-Package
Provides easy installation and loading of core ArcGIS location services packages 'arcgislayers', 'arcgisutils', 'arcgisgeocode', and 'arcgisplaces'. Enabling developers to interact with spatial data and services from 'ArcGIS Online', 'ArcGIS Enterprise', and 'ArcGIS Platform'. Learn more about the 'arcgis' meta-package at <https://r.esri.com/r-bridge-site/>.
Last updated 7 months ago
3.83 score 10 stars 45 scripts 21 downloadsroblog - rOpenSci's blog guidance
It provides templates for roweb2 blogging and help for a GitHub forking workflow.
Last updated 1 months ago
3.40 score 5 stars 1 scriptsCTPPr - Census Transportation Planning Products Program (CTPP) API
Download Census Transportation Planning Products Program (CTPP) Tables for analysis in R
Last updated 6 years ago
2.85 score 14 stars 3 scriptsRArcInfo - Functions to import data from Arc/Info V7.x binary coverages
This package uses the functions written by Daniel Morissette <[email protected]> to read geographical information in Arc/Info V 7.x format and E00 files to import the coverages into R variables.
Last updated 13 years ago
1.34 score 22 scripts 171 downloads