pointblank - Data Validation and Organization of Metadata for Local and Remote Tables
Validate data in data frames, 'tibble' objects, 'Spark' 'DataFrames', and database tables. Validation pipelines can be made using easily-readable, consecutive validation steps. Upon execution of the validation plan, several reporting options are available. User-defined thresholds for failure rates allow for the determination of appropriate reporting actions. Many other workflows are available including an information management workflow, where the aim is to record, collect, and generate useful information on data tables.
Last updated
data-assertionsdata-checkerdata-dictionariesdata-framesdata-inferencedata-managementdata-profilerdata-qualitydata-validationdata-verificationdatabase-tableseasy-to-understandreporting-toolschema-validationtesting-toolsyaml-configuration
11.52 score 1.0k stars 2 dependents 537 scripts 12k downloads
openalexR - Getting Bibliographic Records from 'OpenAlex' Database Using 'DSL' API
A set of tools to extract bibliographic content from 'OpenAlex' database using API <https://docs.openalex.org>.
Last updated
bibliographic-databibliographic-databasebibliometricsbibliometrixscience-mapping
10.81 score 134 stars 7 dependents 330 scripts 19k downloads
ggtrace - Programmatically explore, debug, and manipulate ggplot internals
Programmatically explore, debug, and manipulate ggplot internals. Package ggtrace offers a low-level interface that extends base R capabilities of trace, as well as a family of workflow functions that make interactions with ggplot internals more accessible.
Last updated
debuggingggplot2metaprogramming
6.76 score 61 stars 63 scripts 278 downloads
jlmerclusterperm - Cluster-Based Permutation Analysis for Densely Sampled Time Data
An implementation of fast cluster-based permutation analysis (CPA) for densely-sampled time data developed in Maris & Oostenveld, 2007 <doi:10.1016/j.jneumeth.2007.03.024>. Supports (generalized, mixed-effects) regression models for the calculation of timewise statistics. Provides both a wholesale and a piecemeal interface to the CPA procedure with an emphasis on interpretability and diagnostics. Integrates 'Julia' libraries 'MixedModels.jl' and 'GLM.jl' for performance improvements, with additional functionalities for interfacing with 'Julia' from 'R' powered by the 'JuliaConnectoR' package.
Last updated
cluster-based-permutation-testeegeyetrackingmixed-effects-modelstimeseries
4.99 score 14 stars 14 scripts 643 downloadsJuliaFormulae - Translate R Regression Model Formulae to 'Julia' Syntax
Metaprogramming utilities for converting R regression model formulae to equivalents in 'Julia' <doi:10.1137/141000671>, via modifications to the abstract syntax tree. Supports translations in zero correlation random effects syntax, protection of expressions to be evaluated as-is, interaction terms, and more. Accepts strings or R formula objects and returns modified R formula objects where possible (or a modified string, if not a valid formula in R).
Last updated
3.48 score 2 dependents 3 scripts 199 downloadsggcolormeter - Colormeter Guide Extension
A 'ggplot2' guide extension for fill and color scales in the style of a dashboard meter. The dashboard legend maps onto continuous aesthetics and can be customized for its dimensions and the style of its various components including the labels and frames. Fine-grained control over the positioning of dashboard components is possible via an option to expose the legend-internal coordinate system.
Last updated
ggplot2
2.98 score 19 stars 2 scriptsjlme - Regression Modelling with 'GLM.jl' and 'MixedModels.jl' in 'Julia'
Bindings to 'Julia' packages 'GLM.jl' <doi:10.5281/zenodo.3376013> and 'MixedModels.jl' <doi:10.5281/zenodo.12575371>, powered by 'JuliaConnectoR'. Fits (generalized) linear (mixed-effects) regression models in 'Julia' using familiar model fitting syntax from R. Offers 'broom'-style data frame summary functionalities for 'Julia' regression models.
Last updated
1.70 score 1 stars 6 scripts 12 downloads