Implement confidence intervals for predictions #104

andreasnoack · 2025-10-30T20:24:17Z

This implements the confidence interval as described in Cleveland and Grosse 1991 except for the statistical approximation of the deltas described in section 4 of the paper. They don't seem to share the coefficients of their fit, so it is not easy to implement that part. In addition, computers have much more memory these days, so I don't think the big matrix is a problem in most cases. I'd be interested in anybody knows about more recent approaches to calculating the deltas without the need for the big matrix.

I tried to mimic the interface for predict in GLM but decided to use a struct to return the expensive helper quantities together with the confidence bounds. I'm not a big fan of using Symbols for finite options like here but that is what GLM currently does. Maybe we should change it, but that is a separate concern.

With this, you can construct this plot

The same plot with ggplot2 which uses Cleveland and Grosses code for the computations is

Closes #29

andreasnoack · 2025-10-30T20:55:04Z

The new implementation stores the rows of the hat matrix (and a bit more) for each vertex when constructing the Loess fit. That requires much more memory than the old implementation and this causes the benchmark runs to run out of memory because it tests relatively large problems, see

Loess.jl/benchmark/benchmarks.jl

Lines 7 to 12 in 5ecfefa

    
           for i in 2:6 
        
               n = 10^i 
        
               x = rand(MersenneTwister(42), n) 
        
               y = sqrt.(x) 
        
               SUITE["random"][string(n)] = @benchmarkable loess($x, $y) 
        
           end

. It looks like R might be avoiding these rows in the loess function and only constructs when when you call predict on the loess object but you pay the price of essentially recomputing all the local fits in predict. In my opinion, the main use of Loess these days is for visualization with uncertainty bounds, so I'm leaning towards just accepting that the implementation can't handle as large problems.

github-actions · 2025-11-05T19:43:48Z

Benchmark Report for /home/runner/work/Loess.jl/Loess.jl

Job Properties

Time of benchmarks:
- Target: 28 Nov 2025 - 06:46
- Baseline: 28 Nov 2025 - 06:47
Package commits:
- Target: c07a343
- Baseline: c61757b
Julia commits:
- Target: ca9b666
- Baseline: ca9b666
Julia command flags:
- Target: None
- Baseline: -C,native,-J/opt/hostedtoolcache/julia/1.12.2/x64/lib/julia/sys.so,-g1,-O3,-e,using Pkg; Pkg.update()
Environment variables:
- Target: None
- Baseline: None

Results

A ratio greater than 1.0 denotes a possible regression (marked with ❌), while a ratio less
than 1.0 denotes a possible improvement (marked with ✅). Brackets display tolerances for the benchmark estimates. Only significant results - results
that indicate possible regressions or improvements - are shown below (thus, an empty table means that all
benchmark results remained invariant between builds).

ID	time ratio	memory ratio
`["random", "100"]`	1.56 (5%) ❌	0.33 (1%) ✅
`["random", "1000"]`	1.44 (5%) ❌	1.16 (1%) ❌
`["random", "10000"]`	1.34 (5%) ❌	1.88 (1%) ❌
`["random", "100000"]`	1.28 (5%) ❌	2.00 (1%) ❌
`["random", "1000000"]`	1.27 (5%) ❌	2.02 (1%) ❌
`["ties", "sine"]`	1.39 (5%) ❌	1.77 (1%) ❌

Benchmark Group List

Here's a list of all the benchmark groups executed by this job:

["random"]
["ties"]

Julia versioninfo

Target

Julia Version 1.12.2
Commit ca9b6662be4 (2025-11-20 16:25 UTC)
Build Info:
  Official https://julialang.org release
Platform Info:
  OS: Linux (x86_64-linux-gnu)
      Ubuntu 24.04.3 LTS
  uname: Linux 6.11.0-1018-azure #18~24.04.1-Ubuntu SMP Sat Jun 28 04:46:03 UTC 2025 x86_64 x86_64
  CPU: AMD EPYC 7763 64-Core Processor: 
              speed         user         nice          sys         idle          irq
       #1     0 MHz        238 s          0 s         46 s       1224 s          0 s
       #2     0 MHz        772 s          0 s         56 s        674 s          0 s
       #3     0 MHz        266 s          0 s         63 s       1158 s          0 s
       #4     0 MHz        265 s          0 s         61 s       1161 s          0 s
  Memory: 15.620681762695312 GB (14268.80078125 MB free)
  Uptime: 159.12 sec
  Load Avg:  1.15  0.52  0.2
  WORD_SIZE: 64
  LLVM: libLLVM-18.1.7 (ORCJIT, znver3)
  GC: Built with stock GC
Threads: 1 default, 1 interactive, 1 GC (on 4 virtual cores)

Baseline

Julia Version 1.12.2
Commit ca9b6662be4 (2025-11-20 16:25 UTC)
Build Info:
  Official https://julialang.org release
Platform Info:
  OS: Linux (x86_64-linux-gnu)
      Ubuntu 24.04.3 LTS
  uname: Linux 6.11.0-1018-azure #18~24.04.1-Ubuntu SMP Sat Jun 28 04:46:03 UTC 2025 x86_64 x86_64
  CPU: AMD EPYC 7763 64-Core Processor: 
              speed         user         nice          sys         idle          irq
       #1     0 MHz        258 s          0 s         48 s       1679 s          0 s
       #2     0 MHz        773 s          0 s         57 s       1149 s          0 s
       #3     0 MHz        443 s          0 s         66 s       1457 s          0 s
       #4     0 MHz        541 s          0 s         64 s       1361 s          0 s
  Memory: 15.620681762695312 GB (13977.6875 MB free)
  Uptime: 207.0 sec
  Load Avg:  1.06  0.6  0.25
  WORD_SIZE: 64
  LLVM: libLLVM-18.1.7 (ORCJIT, znver3)
  GC: Built with stock GC
Threads: 1 default, 1 interactive, 1 GC (on 4 virtual cores)

andreasnoack · 2025-11-05T19:52:49Z

I've reduced the problem size of the benchmarks to not exceeding 10000 elements, and that goes through without hitting the memory limit. The overall runtime is slower, which shouldn't be a surprise, given that more work is being done and more allocations are required to be able to compute the uncertainty of the predictions. The question is if there is a need for very large er very quick Loess computations where the uncertainty calculation is just a nuisance.

palday · 2025-11-10T17:35:08Z

The question is if there is a need for very large er very quick Loess computations where the uncertainty calculation is just a nuisance.

Sorry for the delay in following up here, but this is exactly my biggest concern. I have a use case where we put a smooth line on a plot with a very large number of points, but I haven't had a chance to test the performance here on that. I also sometimes use a smooth line on plots of Bernoulli data -- the smooth gives a good first impression of how the probability changes across different x values. I'll try to find some time to test on these big examples this week!

src/Loess.jl

andreasnoack · 2025-11-11T07:38:43Z

@palday It would indeed be good if you could try out some of your larger examples. If we conclude that supporting large problems without uncertainty is important, I can see a couple of ways forward. I'm not particularly happy with any of them so my preference is for the current version, but I'm of course biased by my use cases. The alternatives I see are

Compute the hat matrix rows in predict. This essentially means redoing all the fitting in fit when running predict. I think this is what R does.
Use sparse representations of the F matrices. For this to be useful, we'd have to lower the value of span quite a bit. Otherwise, it won't reduce the storage.
Set a flag in fit to signal if the fitt allows for uncertainty calculations and error out in predict if needed.

src/Loess.jl

andreasnoack · 2025-11-11T15:28:10Z

@devmotion thanks for the comments. I believe I've addressed all of them now.

andreasnoack · 2025-11-17T10:45:27Z

@palday did you get a chance to run your examples? I'm quite eager to get error bounds in Loess plots, so it would be good to find a path forward here.

benchmark/benchmarks.jl

src/Loess.jl

andreasnoack · 2025-11-21T10:49:43Z

@palday bump. Do you object to this or are you okay with moving forward here?

the derivates

Co-authored-by: David Müller-Widmann <devmotion@users.noreply.github.com>

And some other cleanup after merging review comments

… data

Co-authored-by: David Müller-Widmann <devmotion@users.noreply.github.com>

Test on min and lts

Also, use norm instead of sqrt(sum(abs2,...))

Co-authored-by: David Müller-Widmann <devmotion@users.noreply.github.com>

andreasnoack · 2025-11-25T20:00:29Z

@devmotion it looks like the benchmarks now complete but then fail afterwards with an error about the repo being dirty. Have you seen that error before?

devmotion · 2025-11-26T10:27:15Z

No, I have never seen this error. I wonder if it could be related to either the GHA update or the 1.12.2 release.

devmotion · 2025-11-26T10:27:59Z

Ah, I think it could be JuliaCI/PkgBenchmark.jl#135 which was caused by a Project.toml file that Pkg automatically reformatted, and hence the repo seemed to be dirty in CI.

Edit: Although the Project.toml seems fine to me...

devmotion · 2025-11-26T10:31:05Z

Generally, maybe the benchmark setup would be a bit simpler with AirspeedVelocity.jl. But of course either approach should be fine.

andreasnoack · 2025-11-27T13:12:23Z

@devmotion your intuition was right here. With Julia 1.12.2, the Pkg.dev command added a sources line to the Project.toml. For now, I'm working around the error by committing the change before running the benchmarks. Given that it is now still possible to run the n=100000 benchmarks with a modest time penalty, I'll suggest that we go ahead and merge this one. Just to give a perspective, the n=100000 example runs in 75 seconds in R which uses a derived version of the original Fortran implementation. With the changes in this PR the same problem runs in 63 milliseconds.

.github/workflows/benchmark.yml

src/Loess.jl

andreasnoack · 2025-11-28T10:23:02Z

cc @bjarthur as this might be useful in Gadfly.

bjarthur · 2025-11-30T16:12:54Z

Gadfly needs a new maintainer. has for a while. i have switched to Makie going forward.

andreasnoack requested a review from palday October 30, 2025 20:24

andreasnoack force-pushed the an/uncertain branch 2 times, most recently from 3479ce7 to 32fcc1f Compare October 30, 2025 20:30

andreasnoack force-pushed the an/uncertain branch from bd5ef22 to a8c2129 Compare November 5, 2025 19:49

andreasnoack force-pushed the an/uncertain branch from a8c2129 to e8840fd Compare November 5, 2025 20:15

devmotion reviewed Nov 11, 2025

View reviewed changes

src/Loess.jl Show resolved Hide resolved

devmotion reviewed Nov 11, 2025

View reviewed changes

src/Loess.jl Outdated Show resolved Hide resolved

andreasnoack force-pushed the an/uncertain branch 4 times, most recently from 9a2d0f3 to 730608d Compare November 11, 2025 13:54

devmotion reviewed Nov 17, 2025

View reviewed changes

devmotion mentioned this pull request Nov 22, 2025

[WIP] Reduced allocations #106

Draft

andreasnoack and others added 8 commits November 25, 2025 20:16

Center predictors to simplify the calculation of predictions and

1f620d8

the derivates

Compute the hat matrix

5a3349d

Implement confidence intervals for predictions

aa427a4

Reduce benchmark size to avoid running out of memory

d7206ff

Update src/Loess.jl

8377bde

Co-authored-by: David Müller-Widmann <devmotion@users.noreply.github.com>

Update src/Loess.jl

425e5f5

Co-authored-by: David Müller-Widmann <devmotion@users.noreply.github.com>

Update src/Loess.jl

60b45e3

Co-authored-by: David Müller-Widmann <devmotion@users.noreply.github.com>

Update src/Loess.jl

5ba1ca4

Co-authored-by: David Müller-Widmann <devmotion@users.noreply.github.com>

andreasnoack and others added 7 commits November 25, 2025 20:16

Add StatsFuns and remove Distributions in Project.toml

82724c6

And some other cleanup after merging review comments

Handle matrix inputs to predict and thereby also the case without new…

05e3d0d

… data

Some memory optimizations

26b3b64

Update src/Loess.jl

68170a4

Co-authored-by: David Müller-Widmann <devmotion@users.noreply.github.com>

Require Julia 1.8 because of LazyStrings.

d85bc91

Test on min and lts

Reuse the hat matrix when predicting within sample with uncertainty

0014a2f

Also, use norm instead of sqrt(sum(abs2,...))

Apply suggestions from code review

8f5f6ba

Co-authored-by: David Müller-Widmann <devmotion@users.noreply.github.com>

andreasnoack force-pushed the an/uncertain branch from 8f03018 to 8f5f6ba Compare November 25, 2025 19:16

andreasnoack force-pushed the an/uncertain branch from 2e04089 to 8f5f6ba Compare November 25, 2025 20:00

andreasnoack force-pushed the an/uncertain branch 3 times, most recently from ae37d76 to 338a264 Compare November 27, 2025 12:53

devmotion reviewed Nov 27, 2025

View reviewed changes

.github/workflows/benchmark.yml Show resolved Hide resolved

andreasnoack force-pushed the an/uncertain branch 3 times, most recently from 338a264 to 1a08272 Compare November 27, 2025 20:04

Use sources in benchmarks

835d6b6

andreasnoack force-pushed the an/uncertain branch from 1a08272 to 835d6b6 Compare November 27, 2025 20:13

devmotion approved these changes Nov 27, 2025

View reviewed changes

andreasnoack commented Nov 28, 2025

View reviewed changes

src/Loess.jl Outdated Show resolved Hide resolved

Apply suggestion from @andreasnoack

a3f0fcb

andreasnoack merged commit 766779d into master Nov 28, 2025
5 checks passed

andreasnoack deleted the an/uncertain branch November 28, 2025 06:50

Implement confidence intervals for predictions #104

Implement confidence intervals for predictions #104

Uh oh!

Conversation

andreasnoack commented Oct 30, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

andreasnoack commented Oct 30, 2025

Uh oh!

github-actions bot commented Nov 5, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Benchmark Report for /home/runner/work/Loess.jl/Loess.jl

Job Properties

Results

Benchmark Group List

Julia versioninfo

Target

Baseline

Uh oh!

andreasnoack commented Nov 5, 2025

Uh oh!

palday commented Nov 10, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

andreasnoack commented Nov 11, 2025

Uh oh!

Uh oh!

Uh oh!

andreasnoack commented Nov 11, 2025

Uh oh!

andreasnoack commented Nov 17, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

andreasnoack commented Nov 21, 2025

Uh oh!

andreasnoack commented Nov 25, 2025

Uh oh!

devmotion commented Nov 26, 2025

Uh oh!

devmotion commented Nov 26, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

devmotion commented Nov 26, 2025

Uh oh!

andreasnoack commented Nov 27, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

andreasnoack commented Nov 28, 2025

Uh oh!

bjarthur commented Nov 30, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

andreasnoack commented Oct 30, 2025 •

edited

Loading

github-actions bot commented Nov 5, 2025 •

edited

Loading

devmotion commented Nov 26, 2025 •

edited

Loading