pachadotdev opened this issue Jun 1, 2024 · 97 comments
pachadotdev opened this issue Jun 1, 2024 · 97 comments


pachadotdev commented Jun 1, 2024

Submitting Author Name: Pacha
Submitting Author Github Handle: @pachadotdev
Version submitted: 0.2.5 (initially)
Submission type: Stats
Badge grade: silver (?)
Editor: @rkillick
Reviewers: TBD

Archive: TBD
Version accepted: TBD
Language: en

  • Paste the full DESCRIPTION file inside a code block below:
Package: capybara
Type: Package
Title: Fast and Memory Efficient Fitting of Linear Models With High-Dimensional
    Fixed Effects
Version: 0.5.2
Authors@R: c(
        given = "Mauricio",
        family = "Vargas Sepulveda",
        role = c("aut", "cre"),
        email = "[email protected]",
        comment = c(ORCID = "0000-0003-1017-7574"))
    testthat (>= 3.0.0),
Depends: R(>= 3.5.0)
Description: Fast and user-friendly estimation of generalized linear models with
    multiple fixed effects and cluster the standard errors. The method to obtain
    the estimated fixed-effects coefficients is based on Stammann (2018) 
    <> and Gaure (2013)
License: Apache License (>= 2)
LazyData: true
RoxygenNote: 7.3.1
Encoding: UTF-8
NeedsCompilation: yes
LinkingTo: cpp11, cpp11armadillo
VignetteBuilder: knitr
Config/testthat/edition: 3
Roxygen: list(markdown = TRUE, roclets = c("namespace", "rd", "srr::srr_stats_roclet"))


  • Please indicate which of our statistical package categories this package falls under. (Please check one or more appropriate boxes below):

    Statistical Packages

    • Bayesian and Monte Carlo Routines
    • Dimensionality Reduction, Clustering, and Unsupervised Learning
    • Machine Learning
    • Regression and Supervised Learning
    • Exploratory Data Analysis (EDA) and Summary Statistics
    • Spatial Analyses
    • Time Series Analyses
    • Probability Distributions

Pre-submission Inquiry

  • A pre-submission inquiry has been approved in issue#629

General Information

  • Who is the target audience and what are scientific applications of this package?

People (mostly) in the social sciences that need multiple controls in their models. This is especially useful in Economics and International Relations.


  • Have a demonstrated generality of usage beyond one single envisioned use case.
  • Compliance with a good number of standards beyond those identified as minimally necessary.

Technical checks

Confirm each of the following by checking the box.

This package:

Publication options

  • Do you intend for this package to go on CRAN?
  • Do you intend for this package to go on Bioconductor?

Code of conduct

Thanks for submitting to rOpenSci, our editors and @ropensci-review-bot will reply soon.

Editor check started


mpadge commented Jun 3, 2024

@pachadotdev Checks failed because of the missing comma in your DESCRIPTION file which I see you've fixed in your latest commit. Ask the bot to check package again and it should work.

@ropensci-review-bot check package

Thanks, about to send the query.

Editor check started


Copy link

Editor-in-Chief Instructions:

Processing may not proceed until the items marked with ✖️ have been resolved.

jooolia commented Jun 5, 2024

Dear @pachadotdev , Thank you for your submission.

You have checked "Regression and Supervised Learning" which would entail a "Statistical software review" (a different issue template to be opened, but that is ok). Would you be willing to go through the statistical software review as described here and implementing the standards:

Thanks, Julia

Copy link

Dear @pachadotdev , Thank you for your submission.

You have checked "Regression and Supervised Learning" which would entail a "Statistical software review" (a different issue template to be opened, but that is ok). Would you be willing to go through the statistical software review as described here and implementing the standards:

Thanks, Julia

thanks a lot !

indeed, when I opened the issue that was the only available template on my end

I already implemented

Copy link

@ropensci-review-bot check package

Copy link

Thanks, about to send the query.

Copy link


Editor check started


Copy link

Editor-in-Chief Instructions:

Processing may not proceed until the items marked with ✖️ have been resolved.

@jooolia Hi, I cannot find the statistical review template.

Copy link

jooolia commented Jun 29, 2024

Hi @pachadotdev , sorry for the delayed response. You have the option of several different types of templates when you open an issue the statistical one is the last choice (the template is viewable here: Does this help? Thanks, Julia

Copy link

hi @jooolia
I think the software is ready for review as is

Copy link

jooolia commented Jul 26, 2024

Dear @pachadotdev , thanks for your patience. We are discussing with the statistical editors and will get back to you shortly.
Thanks, Julia

Copy link

Dear @pachadotdev , thanks for your patience. We are discussing with the statistical editors and will get back to you shortly. Thanks, Julia

thanks! I uploaded some changes yesterday

Copy link

jooolia commented Jul 29, 2024

Dear @pachadotdev, thanks for updating to address the statistical aspects of the package. Currently there are some srrstatsTODO tags in your srr-stats-standards.R file and we note that "software to be submitted for review must contain no @srrstatsTODO tags" (

Could you also edit your first comment in this issue so that it contains the comment here #645 (comment) (e.g. replace the content with the statistical issue content? )

Thanks, Julia

Copy link

Dear @pachadotdev, thanks for updating to address the statistical aspects of the package. Currently there are some srrstatsTODO tags in your srr-stats-standards.R file and we note that "software to be submitted for review must contain no @srrstatsTODO tags" (

Could you also edit your first comment in this issue so that it contains the comment here #645 (comment) (e.g. replace the content with the statistical issue content? )

Thanks, Julia


I updated the comment. I will need 3-4 weeks to finish all the srrstatsTODO

Copy link

jooolia commented Jul 30, 2024

Ok thanks @pachadotdev for the update. Another editor will take over from me in few days to continue helping with the process. Cheers, Julia

Copy link

Hi, @pachadotdev, I'm taking over where @jooolia is leaving off as EIC. Please let me know what assistance you may need from me.

hi @adamhsparks
I think the pkg is ready for review

Copy link

@ropensci-review-bot check package

Copy link

Thanks, about to send the query.

Error: Issue template has no 'repourl'


@ropensci-review-bot check srr

Copy link

'srr' standards compliance:

  • Complied with: 121 / 130 = 93.1% (general: 69 / 75; regression: 52 / 55)
  • Not complied with: 9 / 130 = 6.9% (general: 6 / 75; regression: 3 / 55)

✔️ This package complies with > 50% of all standards.

Copy link

pachadotdev commented Nov 23, 2024

hi @adamhsparks
I took a different approach now, I and proceeded to explain the code in the srr headers. Some explanations are too obvious but I did exactly what was requested. My approach was "not everybody has read Hansen's Econometrics, so it is not obvious how to pre-process the data"

Copy link

adamhsparks commented Dec 1, 2024

Hi, @pachadotdev, my time as EIC ended at the beginning of last month. Thank you for all your work to get {Capybara} to this point.

Pinging @emilyriederer as the current acting EIC to action your submission.

Copy link

hi @emilyriederer :)

Copy link

Hi @pachadotdev ! Our general guidance is that authors should have one live review at a time to manage overall capacity. Do you want to move forward here and we'll put {redatam} on hold for now?

Copy link

Hi @pachadotdev ! Our general guidance is that authors should have one live review at a time to manage overall capacity. Do you want to move forward here and we'll put {redatam} on hold for now?

yes, Lital has not responded yet, so redatam should wait

@emilyriederer emilyriederer mentioned this issue Dec 10, 2024
21 tasks
@ropensci-review-bot check srr

Copy link

'srr' standards compliance:

  • Complied with: 121 / 130 = 93.1% (general: 69 / 75; regression: 52 / 55)
  • Not complied with: 9 / 130 = 6.9% (general: 6 / 75; regression: 3 / 55)

✔️ This package complies with > 50% of all standards.

@ropensci-review-bot check package

Thanks, about to send the query.

Editor check started


Copy link

Checks for capybara (v0.7.0)

git hash: 5535f883

  • ✔️ Package name is available
  • ✔️ has a 'codemeta.json' file.
  • ✔️ has a 'contributing' file.
  • ✔️ uses 'roxygen2'.
  • ✔️ 'DESCRIPTION' has a URL field.
  • ✔️ 'DESCRIPTION' has a BugReports field.
  • ✔️ Package has at least one HTML vignette
  • ✔️ All functions have examples.
  • ✔️ Package has continuous integration checks.
  • ✔️ Package coverage is 80.4%.
  • ✔️ R CMD check found no errors.
  • ✔️ R CMD check found no warnings.
  • 👀 Function names are duplicated in other packages

(Checks marked with 👀 may be optionally addressed.)

Package License: Apache License (>= 2)

1. rOpenSci Statistical Standards (srr package)

This package is in the following category:

  • Regression and Supervised Learning

✔️ Your code includes the following standards which are not actual standards: [G2.1b]
Your code includes the following standards which are not actual standards: [RE5.1]
Your code includes the following standards which are not actual standards: [RE5.2]
Your code includes the following standards which are not actual standards: [RE5.3]
Your code includes the following standards which are not actual standards: [G3.1b]
Your code includes the following standards which are not actual standards: [G3.3]
Your code includes the following standards which are not actual standards: [G3.1c]
Your code includes the following standards which are not actual standards: [G3.4a]
Your code includes the following standards which are not actual standards: [G3.2]
Your code includes the following standards which are not actual standards: [RE5.4]
Your code includes the following standards which are not actual standards: [RE8.1]
Your code includes the following standards which are not actual standards: [RE5.5]

Click to see the report of author-reported standards compliance of the package with links to associated lines of code, which can be re-generated locally by running the srr_report() function from within a local clone of the repository.

2. Package Dependencies

Details of Package Dependency Usage (click to open)

The table below tallies all function calls to all packages ('ncalls'), both internal (r-base + recommended, along with the package itself), and external (imported and suggested packages). 'NA' values indicate packages to which no identified calls to R functions could be found. Note that these results are generated by an automated code-tagging system which may not be entirely accurate.

type package ncalls
internal base 199
internal capybara 117
internal utils 49
internal grDevices 13
internal graphics 2
imports stats 95
imports magrittr 6
imports dplyr 4
imports MASS 4
imports ggplot2 2
imports Formula 1
imports kendallknight 1
imports rlang 1
suggests broom NA
suggests knitr NA
suggests rmarkdown NA
suggests testthat NA
suggests tidyr NA
suggests units NA
linking_to cpp11 NA
linking_to cpp11armadillo NA

Click below for tallies of functions used in each package. Locations of each call within this package may be generated locally by running 's <- pkgstats::pkgstats(<path/to/repo>)', and examining the 'external_calls' table.


for (12), list (10), drop (9), length (9), names (9), attr (8), nrow (8), abs (7), c (7), (7), mapply (6), matrix (6), max (6), beta (5), ncol (5), try (5), solve (4), sum (4), as.logical (3), cbind (3), class (3), data.frame (3), diag (3), getOption (3), lapply (3), rep (3), sqrt (3), structure (3), summary (3), trace (3), with (3), all (2), apply (2), integer (2), levels (2), nchar (2), Reduce (2), rownames (2), seq_len (2), vapply (2), (1), as.list (1), as.numeric (1), as.vector (1), colnames (1), colSums (1), crossprod (1), gamma (1), inherits (1), mean (1), order (1), paste0 (1), suppressWarnings (1), unlist (1), unname (1)


get_index_list_ (8), feglm_fit_ (3), fixed_effects (3), group_sums_ (3), nobs_ (3), partial_mu_eta_ (3), apes_bias_correction_ (2), apes_set_adj_ (2), bias_corr (2), center_variables_r_ (2), col_types (2), feglm (2), gamma_ (2), get_alpha_ (2), get_score_matrix_ (2), group_sums_cov_ (2), init_theta_ (2), summary_estimates_max_width_ (2), temp_var_ (2), apes (1), apes_adjust_covariance_ (1), apes_bias_check_binary_model_ (1), apes_bias_check_object_ (1), apes_bias_check_panel_ (1), augment.feglm (1), augment.felm (1), autoplot.feglm (1), autoplot.felm (1), bias_corr_check_fixed_effects_ (1), check_control_ (1), check_data_ (1), check_factor_ (1), check_family_ (1), check_formula_ (1), check_linear_dependence_ (1), check_response_ (1), check_weights_ (1), coef.apes (1), coef.feglm (1), coef.felm (1), coef.summary.apes (1), coef.summary.feglm (1), coef.summary.felm (1), confint.feglm (1), confint.felm (1), drop_by_link_type_ (1), feglm_control (1), feglm_offset_ (1), feglm_offset_fit_ (1), felm (1), fenegbin (1), fenegbin_check_convergence_ (1), fenegbin_result_list_ (1), fepoisson (1), fitted.feglm (1), fitted.felm (1), glance.feglm (1), glance.felm (1), group_sums_spectral_ (1), group_sums_var_ (1), model_frame_ (1), model_response_ (1), predict.feglm (1), predict.felm (1), print.apes (1), print.feglm (1), print.felm (1), print.summary.apes (1), print.summary.feglm (1), print.summary.felm (1), second_order_derivative_ (1), start_guesses_ (1), summary_estimates_ (1), summary_estimates_cols_ (1), summary_estimates_dashes_ (1), summary_estimates_header_ (1), summary_estimates_print_rows_ (1), summary_estimates_signif_ (1), summary_family_ (1), summary_fisher_ (1), summary_formula_ (1), summary.apes (1), summary.feglm (1), summary.felm (1), vcov_feglm_cluster_data_ (1), vcov_feglm_hessian_covariance_ (1), vcov_feglm_vars_ (1)


family (30), formula (23), nobs (12), model.matrix (10), terms (5), deviance (3), pnorm (3), vcov (3), qnorm (2), weights (2), offset (1), poisson (1)


data (48), combn (1)


cm (13)


%>% (6)


all_of (2), select (2)


negative.binomial (2), (2)


aes (1), ggplot (1)


pie (2)


Formula (1)


kendall_cor (1)


sym (1)

3. Statistical Properties

This package features some noteworthy statistical properties which may need to be clarified by a handling editor prior to progressing.

Details of statistical properties (click to open)

The package has:

  • code in C++ (29% in 7 files) and R (71% in 25 files)
  • 1 authors
  • 1 vignette
  • 1 internal data file
  • 9 imported packages
  • 18 exported functions (median 13 lines of code)
  • 192 non-exported functions in R (median 8 lines of code)
  • 51 R functions (median 5 lines of code)

Statistical properties of package structure as distributional percentiles in relation to all current CRAN packages
The following terminology is used:

  • loc = "Lines of Code"
  • fn = "function"
  • exp/not_exp = exported / not exported

All parameters are explained as tooltips in the locally-rendered HTML version of this report generated by the checks_to_markdown() function

The final measure (fn_call_network_size) is the total number of calls between functions (in R), or more abstract relationships between code objects in other languages. Values are flagged as "noteworthy" when they lie in the upper or lower 5th percentile.

measure value percentile noteworthy
files_R 25 85.3
files_src 7 93.6
files_vignettes 2 81.7
files_tests 14 91.6
loc_R 1633 77.6
loc_src 664 56.9
loc_vignettes 81 18.4
loc_tests 601 75.0
num_vignettes 1 58.8
data_size_total 285103 88.6
data_size_median 285103 95.4 TRUE
n_fns_r 210 89.1
n_fns_r_exported 18 63.8
n_fns_r_not_exported 192 91.8
n_fns_src 51 74.6
n_fns_per_file_r 5 67.4
n_fns_per_file_src 7 71.4
num_params_per_fn 3 29.4
loc_per_fn_r 8 24.5
loc_per_fn_r_exp 14 33.3
loc_per_fn_r_not_exp 8 23.0
loc_per_fn_src 5 11.5
rel_whitespace_R 21 80.5
rel_whitespace_src 31 71.5
rel_whitespace_vignettes 23 11.5
rel_whitespace_tests 31 81.6
doclines_per_fn_exp 53 65.9
doclines_per_fn_not_exp 0 0.0 TRUE
fn_call_network_size 194 87.5

3a. Network visualisation

Click to see the interactive network visualisation of calls between objects in package

4. goodpractice and other checks

Details of goodpractice checks (click to open)

3a. Continuous Integration Badges


GitHub Workflow Results

id name conclusion sha run_number date
11989684165 format_check success 5535f8 79 2024-11-23
11989684069 pages build and deployment success 5535f8 97 2024-11-23
11989684163 R-CMD-check success 5535f8 99 2024-11-23
11989684168 test-coverage success 5535f8 78 2024-11-23

3b. goodpractice results

R CMD check with rcmdcheck

R CMD check generated the following note:

  1. checking installed package size ... NOTE
    installed size is 13.2Mb
    sub-directories of 1Mb or more:
    libs 12.4Mb

R CMD check generated the following check_fail:

  1. rcmdcheck_reasonable_installed_size

Test coverage with covr

Package coverage: 80.44

Cyclocomplexity with cyclocomp

No functions have cyclocomplexity >= 15

Static code analyses with lintr

lintr found no issues with this package!

5. Other Checks

Details of other checks (click to open)

✖️ The following 4 function names are duplicated in other packages:

    • bias_corr from bife
    • feglm from alpaca
    • felm from lfe
    • fixed_effects from baggr, gratia, gravity

Package Versions

package version

Editor-in-Chief Instructions:

This package is in top shape and may be passed on to a handling editor

@ropensci-review-bot assign @rkillick as editor

Copy link

@ropensci-review-bot help

@ropensci-review-bot help

@ropensci-review-bot assign @rkillick as editor

Copy link

Assigned! @rkillick is now the editor

Copy link

Dear @pachadotdev and @rkillick just to let you know that my EiC rotation just started and I'm checking all open issues.

Looks like the package passes all checks cleanly 💯

@rkillick are you still able to handle this submission?

Copy link

rkillick commented Feb 2, 2025

@maurolepore Yes, I've been trying to find reviewers.

Copy link

Great, thanks so much! I'll step back.

Copy link

@maurolepore Yes, I've been trying to find reviewers.

hi @rkillick, can I send the package to CRAN in the meanwhile?

Copy link

@pachadotdev while I'm here I'll share the relevant guideline:

We strongly suggest submitting your package for review before publishing on CRAN or submitting a software paper describing the package to a journal. Review feedback may result in major improvements and updates to your package, including renaming and breaking changes to functions.

So I interpret this as we don't encourage it but also we won't stop you.

Copy link

@pachadotdev while I'm here I'll share the relevant guideline:

We strongly suggest submitting your package for review before publishing on CRAN or submitting a software paper describing the package to a journal. Review feedback may result in major improvements and updates to your package, including renaming and breaking changes to functions.

So I interpret this as we don't encourage it but also we won't stop you.

thx, I will wait then :)

