Fix discovered bugs #244

rix133 · 2025-10-14T14:07:43Z

This pull request introduces several improvements and optimizations to the RDBEScore package, focusing on estimation object creation, memory management, and documentation. The main changes include more efficient handling of sub-sampling relationships, an option to exclude design variables for smaller objects, improved joining logic, and the addition of a GitHub Actions workflow for pkgdown site deployment.

Estimation object creation and memory management:

Added an incDesignVariables parameter to createRDBESEstObject() to allow users to exclude design variables, reducing object size. Character columns are now converted to factors for further memory optimization. [1] [2] [3]
Improved handling of sub-sampling relationships in the SA table by replacing recursive logic with a more efficient self-join and lookup table (prepareSubSampleLevelLookup), including warnings for missing or non-unique matches. [1] [2] [3]
Added frequent calls to gc() (garbage collection) throughout the estimation object creation and joining functions to reduce memory usage during large data processing. [1] [2] [3] [4] [5] [6]

Joining logic and table handling:

Improved join field selection for hierarchy 7 in procRDBESEstObjUppHier() and added comments about data.table vs. dplyr joins for future optimization. [1] [2]
Updated logic for selecting VDid fields to avoid confusion when multiple fields are present.

Documentation and workflow:

Added a GitHub Actions workflow (.github/workflows/pkgdown.yaml) for automated pkgdown site building and deployment to GitHub Pages.
Updated .Rbuildignore to exclude pkgdown config and output files.
Improved function documentation for clarity, including return value descriptions and parameter updates. [1] [2] [3]

Miscellaneous:

Updated URLs in the DESCRIPTION file to include the pkgdown site.
Optimized filtering logic in filterRDBESDataObject() to use data.table for efficiency.

Merge branch 'dev' of https://github.com/ices-tools-dev/RDBEScore into dev # Conflicts: # R/lowerTblData.R # R/upperTblData.R # man/lowerTblData.Rd # man/upperTblData.Rd

- fixed issue with LE table in hierarchies 7 and 9 - added gc() in some points but code optimalization needed. - #issue 155

#229

#155

This reverts commit cea8ae3.

Fix test coverage works

Fix test version

…o dev

…code rewritten to use data.table rather than dplyr (#233)

… updated code (#233).

…his for VDid (#233)

…sign variables get included in the estimation object output (#233)

…233)

renamed vignettes

"see other packages" section added with all other vignettes #238

#243 #245

…o dev

vignettes should not start with number #244

…o dev

Copilot

Pull Request Overview

This pull request standardizes vignette titles, improves cross-referencing between vignettes, fixes spelling errors, and removes obsolete test data directories. The changes focus primarily on documentation consistency and test data cleanup.

Key Changes:

Standardized vignette titles with numeric prefixes (e.g., "01a", "02b") for better organization
Added dynamic cross-referencing sections to all vignettes listing related documentation
Fixed spelling errors ("prevenes" → "prevents")
Removed two obsolete H7 test data directories (v1.19.18 and v1.19.26)
Enhanced test coverage for validation rules, import functionality, and estimation object creation

Reviewed Changes

Copilot reviewed 88 out of 125 changed files in this pull request and generated 1 comment.

Show a summary per file

File	Description
vignettes/*.Rmd (8 files)	Updated titles with numeric prefixes and added dynamic vignette cross-reference sections
tests/testthat/test-validateRDBESDataObject.R	Added tests for SL/IS validation rules
tests/testthat/test-importRDBESDataZIP.R	New test file for ZIP import functionality
tests/testthat/test-importRDBESDataCSV.R	New test file for CSV import functionality
tests/testthat/test-createRDBESEstObject.R	Added SSid assignments for parent-child relationships
tests/testthat/test-createRDBESDataObject.R	Updated tests to expect clear error messages for malformed data
tests/testthat/h7_v_1_19_26/* (11 files)	Removed obsolete H7 v1.19.26 test data directory
tests/testthat/h7_v_1_19_18/* (10 files)	Removed obsolete H7 v1.19.18 test data directory
man/upperTblData.Rd	Removed unnecessary blank line in examples section

_{Tip: Customize your code reviews with copilot-instructions.md. Create the file or learn how to get started.}

Copilot · 2025-10-15T13:36:53Z

vignettes/v02c-Generating-NAs-for-species-not-targeted-by-sampling.Rmd

 The difference between these two cases can be specified via the argument `overwriteSampled` in the function `generateNAsUsingSL`. By default (estimation case) the argument is set to TRUE which makes `generateNAsUsingSL` set the weights of these extra species to NA. But, by explicitly setting that argument as `overwriteSampled=FALSE` the information collected can also kept.

-To demonstrate this we carry out a small alteration of the example data, removing the *Nephrops norvegicus* from the Species List. This creates a somewhat  atypical situation (it configures a case where of a haul where nothing was supposed to be looked for but still *Nephrops norvegicus* was registered) that is used here for sake of simplifying the example.
+To demonstrate this we carry out a small alteration of the example data, removing the *Nephrops norvegicus* from the Species List. This creates a somewhat atypical situation: a haul where nothing was supposed to be looked for but still *Nephrops norvegicus* was registered. We use such situation here for sake of a simple example.


The phrasing 'for sake of a simple example' should be 'for the sake of a simple example' to be grammatically correct.

Suggested change

To demonstrate this we carry out a small alteration of the example data, removing the *Nephrops norvegicus* from the Species List. This creates a somewhat atypical situation: a haul where nothing was supposed to be looked for but still *Nephrops norvegicus* was registered. We use such situation here for sake of a simple example.

To demonstrate this we carry out a small alteration of the example data, removing the *Nephrops norvegicus* from the Species List. This creates a somewhat atypical situation: a haul where nothing was supposed to be looked for but still *Nephrops norvegicus* was registered. We use such situation here for the sake of a simple example.

Merge pull request #244 from ices-tools-dev/dev

davidcurrie2001 and others added 30 commits April 14, 2025 14:29

Fix missing hash in example code

9ad8a63

Use package name in example

686f6cf

Merge branch 'dev' of https://github.com/ices-tools-dev/RDBEScore into dev # Conflicts: # R/lowerTblData.R # R/upperTblData.R # man/lowerTblData.Rd # man/upperTblData.Rd

Bring in line with GitHub

d81ff0a

temporary fix

60dbd84

- fixed issue with LE table in hierarchies 7 and 9 - added gc() in some points but code optimalization needed. - #issue 155

Update importRDBESDataCSV.R

fb09de3

#229

optimization, less dplyr in joining

cea8ae3

#155

Revert "optimization, less dplyr in joining"

5be9f3d

This reverts commit cea8ae3.

Merge pull request #231 from ices-tools-dev/patch/upload-artifact-v4

38cb277

Fix test coverage works

Merge pull request #232 from ices-tools-dev/main

5ce80bf

Fix test version

Merge branch 'dev' of https://github.com/ices-tools-dev/RDBEScore int…

cbca310

…o dev

Function runs much quicker (sped up the code which handles sub-sampling)

8f26b55

Changed a join to use data.table rather than dplyr (#233)

d50ba6f

Don't just match sub-samples using SAparSeqNum - also use SSid. Some …

d762b39

…code rewritten to use data.table rather than dplyr (#233)

Ensure SSid also matches between sample and sub-sample in test data

594772c

Use factors in RDBESEstObject to reduce size. Fix column order bug in…

d7c396c

… updated code (#233).

data.table treats duplicate columns differently than dplyr - handle t…

a573227

…his for VDid (#233)

Actually fix the VDid code this time (#233)

a2238dc

Changed order of SA and LE in H9

e1f7997

Added in a new incDesignVariables parameter to control whether the de…

518d02b

…sign variables get included in the estimation object output (#233)

Revert back to using dplyr - data.table was giving the wrong result (#…

d6547ba

…233)

Minor change

adf7fc1

vignette renamed

35ca136

more renaming of vignettes

ef21d56

renamed vignettes

77f62c8

renamed vignettes

fix test data to latest format

6a19b2f

error with incosistent colum count input csv file

12b6872

update H7 dataset to latest format

69ff587

update rendered documentation

fdb18ee

new file for zip tests

1011e05

updated "see other packages" section

6337208

"see other packages" section added with all other vignettes #238

rix133 and others added 23 commits October 15, 2025 12:06

#245 Pckg_SDAResources_agsrs_H1 removal

e3e1ceb

Merge branch 'dev' of github.com:ices-tools-dev/RDBEScore into dev

7587dd1

update species names

1511476

updated to latest version of RDBES

a524bec

#243 #245

Merge branch 'dev' of https://github.com/ices-tools-dev/RDBEScore int…

87021f5

…o dev

#245 removing unused datasets from SDA_Resources

5050cc6

parameter validate was missing from preamble

250517a

Intermediate version with dplyr

97812a7

Merge branch 'dev' of github.com:ices-tools-dev/RDBEScore into dev

bd75995

Merge branch 'dev' of github.com:ices-tools-dev/RDBEScore into dev

1475fb9

file renaming

3ce47fd

vignettes should not start with number #244

Merge branch 'dev' of github.com:ices-tools-dev/RDBEScore into dev

03e020b

Upd Ratio Est fun

ea3a535

Restore to Nuno version

a16e441

Merge branch 'dev' of github.com:ices-tools-dev/RDBEScore into dev

d6b5bd2

delete doumentation

962e8d8

Merge branch 'dev' of github.com:ices-tools-dev/RDBEScore into dev

473be0f

Merge branch 'dev' of github.com:ices-tools-dev/RDBEScore into dev

80cd07e

#245 update manuals

9cd5504

Upd param ratio est

4b99fc4

Merge branch 'dev' of https://github.com/ices-tools-dev/RDBEScore int…

2769b7d

…o dev

because .N is a data.table function to count #244

4315ec9

fix documentation #244

419c804

rix133 requested review from Copilot and erosquesada October 15, 2025 13:35

Copilot AI reviewed Oct 15, 2025

View reviewed changes

erosquesada approved these changes Oct 15, 2025

View reviewed changes

rix133 merged commit 537dbce into main Oct 15, 2025
15 checks passed

rix133 mentioned this pull request Oct 15, 2025

createRDBESEstObject() uses a lot of memory #233

Closed

rix133 added a commit that referenced this pull request Oct 15, 2025

Merge pull request #250 from ices-tools-dev/main

0e59118

Merge pull request #244 from ices-tools-dev/dev

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix discovered bugs #244

Fix discovered bugs #244

Uh oh!

rix133 commented Oct 14, 2025

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Oct 15, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants

	To demonstrate this we carry out a small alteration of the example data, removing the Nephrops norvegicus from the Species List. This creates a somewhat atypical situation: a haul where nothing was supposed to be looked for but still Nephrops norvegicus was registered. We use such situation here for sake of a simple example.
	To demonstrate this we carry out a small alteration of the example data, removing the Nephrops norvegicus from the Species List. This creates a somewhat atypical situation: a haul where nothing was supposed to be looked for but still Nephrops norvegicus was registered. We use such situation here for the sake of a simple example.

Fix discovered bugs #244

Fix discovered bugs #244

Uh oh!

Conversation

rix133 commented Oct 14, 2025

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

Copilot AI Oct 15, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants