hdcuremodels #690

kelliejarcher · 2025-02-26T19:15:08Z

Submitting Author Name: Kellie J. Archer
Submitting Author Github Handle: @kelliejarcher
Repository: https://github.com/kelliejarcher/hdcuremodels
Submission type: Pre-submission
Language: en

Paste the full DESCRIPTION file inside a code block below:

Package: hdcuremodels
Title: Penalized Mixture Cure Models for High-Dimensional Data
Version: 0.0.2
Date: 2025-02-26
Authors@R: 
    c(person("Han", "Fu", role = "aut"), person(c("Kellie J."), "Archer", email=
    "[email protected]", role = c("aut","cre"), comment = c(ORCID="0000-0003-1555-5781")))
Description: Provides functions for fitting various penalized parametric and semi-parametric mixture cure models with different penalty functions, testing for a significant cure fraction, and testing for sufficient follow-up as described in Fu et al (2022)<doi:10.1002/sim.9513> and Archer et al (2024)<doi:10.1186/s13045-024-01553-6>. False discovery rate controlled variable selection is provided using model-X knock-offs. 
License: MIT + file LICENSE
Encoding: UTF-8
Depends: R (>= 4.2.0)
Imports: doParallel,
         flexsurv,
         flexsurvcure,
         foreach,
         ggplot2,
         ggpubr,
         glmnet,
         knockoff,
         mvnfast,
         parallel,
         plyr,
         methods,
         survival
Roxygen: list(markdown = TRUE, roclets = c ("namespace", "rd", "srr::srr_stats_roclet"))
RoxygenNote: 7.3.2
Suggests: 
    knitr,
    rmarkdown,
    roxygen2
VignetteBuilder: knitr
LazyData: true

Scope

Please indicate which category or categories from our package fit policies or statistical package categories this package falls under. (Please check one or more appropriate boxes below):

Data Lifecycle Packages
- data retrieval
- data extraction
- data munging
- data deposition
- data validation and testing
- workflow automation
- version control
- citation management and bibliometrics
- scientific software wrappers
- field and lab reproducibility tools
- database software bindings
- geospatial data
- text analysis
Statistical Packages
- Bayesian and Monte Carlo Routines
- Dimensionality Reduction, Clustering, and Unsupervised Learning
- Machine Learning
- [X ] Regression and Supervised Learning
- Exploratory Data Analysis (EDA) and Summary Statistics
- Spatial Analyses
- Time Series Analyses
- Probability Distributions
Explain how and why the package falls under these categories (briefly, 1-2 sentences). Please note any areas you are unsure of:
If submitting a statistical package, have you already incorporated documentation of standards into your code via the srr package?

Yes, I tried to add notations via the srr package. This is all very new so advice would be most welcome.

Who is the target audience and what are scientific applications of this package?

Analysts interested in modeling a time-to-event outcome when a subset of patients experience long-term survival or cure. This package permits fitting penalized mixture cure models so the functions can handle modeling the time-to-event outcome when the covariate/predictor space is high-dimensional.

Are there other R packages that accomplish the same thing? If so, how does yours differ or meet our criteria for best-in-category?

None of the existing packages that fit mixture cure models (MCMs) are capable of handling high-dimensional datasets. Only penPHcure includes a LASSO penalty to perform variable selection for scenarios when the sample size exceeds the number of predictors. Other R packages that can be used for fitting MCMs include:

cuRe (Jakobsen, 2023) can be used to fit parametric MCMs on a relative survival scale;
CureDepCens (Schneider and Grandemagne dos Santos, 2023) can be used to fit piecewise exponential or Weibull model with dependent censoring;
curephEM (Hou and Ren, 2024) can be used to fit a MCM where the latency is modeled using a Cox PH model;
flexsurvcure (Amdahl, 2022) can be used to fit parametric mixture and non-mixture cure models;
geecure (Niu and Peng, 2018) can be used to fit marginal MCM for clustered survival data;
GORCure (Zhou et al, 2017) can be used to fit generalized odds rate MCM with interval censored data;
mixcure (Peng, 2020) can be used to fit non-parametric, parametric, and semiparametric MCMs;
npcure (López-de-Ullibarri and López-Cheda, 2020) can be used to non-parametrically estimate incidence and latency;
npcurePK (Safari et al, 2023) can be used to non-parametrically estimate incidence and latency when cure is partially observed;
penPHcure(Beretta and Heuchenne, 2019) can be used to fit semi-parametric PH MCMs with time-varying covariates; and
smcure (Cai et al 2022) can be used to fit semi-parametric (PH and AFT) MCMs.

(If applicable) Does your package comply with our guidance around Ethics, Data Privacy and Human Subjects Research?

Not applicable.

Any other questions or issues we should be aware of?:

I am very inexperienced using GitHub and have not used it for collaboration before. I did post a version of this package on CRAN last June and more recently learned about ROpenSci so the github version is my initial attempt to adhere to your standards. I have not submitted a peer-reviewed manuscript yet as I would prefer to have an ROpenSci review first. Also, when I ran pkgcheck and tried to look at the summary I received a message that read, "Error: No GNU global installation found." and was unclear how to proceed.

The text was updated successfully, but these errors were encountered:

mpadge · 2025-02-27T11:07:40Z

Hi @kelliejarcher, and thank you for your pre-submission. No worries about inexperience - rOpenSci strives to be as welcoming and inclusive as possible, and to always be answer any questions you might have, and to help you along the way. As this is a pre-submission inquiry, feel free to ask questions here, or alternatively open issues in your own repository, cross-link them here by pasting the url for this issue in the comment, and ping me there. (Note that we try to keep the full submissions as "clean" as possible to help focus on reviews, while pre-submissions are the place for more general questions and dicussions.)

Specific responses to your questions:

The "Error: No GNU global installation found" is because {pkgcheck}, and the {pkgstats} package it uses to analyses packages, require a couple of system libraries, including "GNU global". If you're on a Linux-based or MacOS, installation is simply, generally by using standard package manager (apt-get, homebrew, or whatever), to install global. If you're on Windows, it's tricker, but you could start with the links in the {pkgstats} installation vignette.
The {srr} question sounds more general, and is maybe best moved to a specific issue within your repo? If you ping me there, I'll happily help further.

More generally, your package definitely looks like a good fit for statistical software review, and definitely within the category you've already indicated. Looking forward to working towards a full submission!

kelliejarcher · 2025-02-27T20:22:43Z

Hi Mark, I used homebrew to install global and it is in my search path, as verified by global --version global (GNU Global) 6.6.14 Powered by Berkeley DB 1.85 and SQLite3 3.49.1. Copyright (c) 1996-2024 Tama Communications Corporation License GPLv3+: GNU GPL version 3 or later http://www.gnu.org/licenses/gpl.html This is free software; you are free to change and redistribute it. There is NO WARRANTY, to the extent permitted by law. I also restarted my machine but am still getting

x <- pkgcheck (mydir)

Error: No GNU global installation found. I also have ctags installed: ctags --version Universal Ctags 6.1.0, Copyright (C) 2015-2023 Universal Ctags Team Universal Ctags is derived from Exuberant Ctags. Exuberant Ctags 5.8, Copyright (C) 1996-2009 Darren Hiebert Compiled: Feb 17 2025, 07:54:50 URL: https://ctags.io/ Output version: 0.0 Optional compiled features: +wildcards, +regex, +gnulib_fnmatch, +gnulib_regex, +iconv, +option-directory, +xpath, +json, +interactive, +yaml, +case-insensitive-filenames, +packcc, +optscript, +pcre2 But in R when I run ctags_test() I get Error: No GNU global installation found. Do you know why this doesn’t work? Also, would you please provide more specific guidance about what you mean by “or alternatively open issues in your own repository, cross-link them here by pasting the url for this issue in the comment, and ping me there.”? Does that mean that under the repository, I click on <Issues>, click on <New issue> and then type whatever text and then send a link? If so, do you want individual issues entered separately or can you have one long laundry list of issues? Sorry for all the questions. Best regards, Kellie From: mark padgham ***@***.***> Date: Thursday, February 27, 2025 at 6:08 AM To: ropensci/software-review ***@***.***> Cc: Archer, Kellie ***@***.***>, Mention ***@***.***> Subject: Re: [ropensci/software-review] hdcuremodels (Issue #690) Hi @kelliejarcher, and thank you for your pre-submission. No worries about inexperience - rOpenSci strives to be as welcoming and inclusive as possible, and to always be answer any questions you might have, and to help you along the way. As Hi @kelliejarcher<https://urldefense.com/v3/__https:/github.com/kelliejarcher__;!!KGKeukY!1cCR1ET6HXFR0W8HksqghUebuvDfxPa2AmphB0DMkwS6B8XR4aTFPY3kwAa-MmzbsMJbCBpvou9AHHRYSIwa0Syi9g$>, and thank you for your pre-submission. No worries about inexperience - rOpenSci strives to be as welcoming and inclusive as possible, and to always be answer any questions you might have, and to help you along the way. As this is a pre-submission inquiry, feel free to ask questions here, or alternatively open issues in your own repository, cross-link them here by pasting the url for this issue in the comment, and ping me there. (Note that we try to keep the full submissions as "clean" as possible to help focus on reviews, while pre-submissions are the place for more general questions and dicussions.) Specific responses to your questions: * The "Error: No GNU global installation found" is because {pkgcheck}, and the {pkgstats} package it uses to analyses packages, require a couple of system libraries, including "GNU global". If you're on a Linux-based or MacOS, installation is simply, generally by using standard package manager (apt-get, homebrew, or whatever), to install global. If you're on Windows, it's tricker, but you could start with the links in the {pkgstats} installation vignette<https://urldefense.com/v3/__https:/docs.ropensci.org/pkgstats/articles/installation.html__;!!KGKeukY!1cCR1ET6HXFR0W8HksqghUebuvDfxPa2AmphB0DMkwS6B8XR4aTFPY3kwAa-MmzbsMJbCBpvou9AHHRYSIzKa2EYYA$>. * The {srr} question sounds more general, and is maybe best moved to a specific issue within your repo? If you ping me there, I'll happily help further. More generally, your package definitely looks like a good fit for statistical software review, and definitely within the category you've already indicated. Looking forward to working towards a full submission! — Reply to this email directly, view it on GitHub<https://urldefense.com/v3/__https:/github.com/ropensci/software-review/issues/690*issuecomment-2687629762__;Iw!!KGKeukY!1cCR1ET6HXFR0W8HksqghUebuvDfxPa2AmphB0DMkwS6B8XR4aTFPY3kwAa-MmzbsMJbCBpvou9AHHRYSIy8df-rZA$>, or unsubscribe<https://urldefense.com/v3/__https:/github.com/notifications/unsubscribe-auth/AUKATYSOBKTX6VIN767FLMT2R3WZDAVCNFSM6AAAAABX6AKLP6VHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDMOBXGYZDSNZWGI__;!!KGKeukY!1cCR1ET6HXFR0W8HksqghUebuvDfxPa2AmphB0DMkwS6B8XR4aTFPY3kwAa-MmzbsMJbCBpvou9AHHRYSIxpDg_ANA$>. You are receiving this because you were mentioned.Message ID: ***@***.***> [Image removed by sender. mpadge]mpadge left a comment (ropensci/software-review#690)<https://urldefense.com/v3/__https:/github.com/ropensci/software-review/issues/690*issuecomment-2687629762__;Iw!!KGKeukY!1cCR1ET6HXFR0W8HksqghUebuvDfxPa2AmphB0DMkwS6B8XR4aTFPY3kwAa-MmzbsMJbCBpvou9AHHRYSIy8df-rZA$> Hi @kelliejarcher<https://urldefense.com/v3/__https:/github.com/kelliejarcher__;!!KGKeukY!1cCR1ET6HXFR0W8HksqghUebuvDfxPa2AmphB0DMkwS6B8XR4aTFPY3kwAa-MmzbsMJbCBpvou9AHHRYSIwa0Syi9g$>, and thank you for your pre-submission. No worries about inexperience - rOpenSci strives to be as welcoming and inclusive as possible, and to always be answer any questions you might have, and to help you along the way. As this is a pre-submission inquiry, feel free to ask questions here, or alternatively open issues in your own repository, cross-link them here by pasting the url for this issue in the comment, and ping me there. (Note that we try to keep the full submissions as "clean" as possible to help focus on reviews, while pre-submissions are the place for more general questions and dicussions.) Specific responses to your questions: * The "Error: No GNU global installation found" is because {pkgcheck}, and the {pkgstats} package it uses to analyses packages, require a couple of system libraries, including "GNU global". If you're on a Linux-based or MacOS, installation is simply, generally by using standard package manager (apt-get, homebrew, or whatever), to install global. If you're on Windows, it's tricker, but you could start with the links in the {pkgstats} installation vignette<https://urldefense.com/v3/__https:/docs.ropensci.org/pkgstats/articles/installation.html__;!!KGKeukY!1cCR1ET6HXFR0W8HksqghUebuvDfxPa2AmphB0DMkwS6B8XR4aTFPY3kwAa-MmzbsMJbCBpvou9AHHRYSIzKa2EYYA$>. * The {srr} question sounds more general, and is maybe best moved to a specific issue within your repo? If you ping me there, I'll happily help further. More generally, your package definitely looks like a good fit for statistical software review, and definitely within the category you've already indicated. Looking forward to working towards a full submission! — Reply to this email directly, view it on GitHub<https://urldefense.com/v3/__https:/github.com/ropensci/software-review/issues/690*issuecomment-2687629762__;Iw!!KGKeukY!1cCR1ET6HXFR0W8HksqghUebuvDfxPa2AmphB0DMkwS6B8XR4aTFPY3kwAa-MmzbsMJbCBpvou9AHHRYSIy8df-rZA$>, or unsubscribe<https://urldefense.com/v3/__https:/github.com/notifications/unsubscribe-auth/AUKATYSOBKTX6VIN767FLMT2R3WZDAVCNFSM6AAAAABX6AKLP6VHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDMOBXGYZDSNZWGI__;!!KGKeukY!1cCR1ET6HXFR0W8HksqghUebuvDfxPa2AmphB0DMkwS6B8XR4aTFPY3kwAa-MmzbsMJbCBpvou9AHHRYSIxpDg_ANA$>. You are receiving this because you were mentioned.Message ID: ***@***.***>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

hdcuremodels #690

hdcuremodels #690

kelliejarcher commented Feb 26, 2025

mpadge commented Feb 27, 2025

kelliejarcher commented Feb 27, 2025 via email

hdcuremodels #690

hdcuremodels #690

Comments

kelliejarcher commented Feb 26, 2025

Scope

mpadge commented Feb 27, 2025

kelliejarcher commented Feb 27, 2025 via email