uniq: fix -w to count bytes in C locale by aguimaraes · Pull Request #11061 · uutils/coreutils

aguimaraes · 2026-02-23T01:46:19Z

Summary

uniq -w N should count bytes in C/POSIX locale and characters in UTF-8 locale. Currently it always counts UTF-8 characters regardless of locale.

Changes

Added is_c_locale() helper that checks LC_ALL, LC_CTYPE, LANG in order
Modified key_end_index() to use byte counting when in C locale
Added test for C locale byte counting behavior
Fixed test_stdin_w1_multibyte to explicitly set UTF-8 locale (it was implicitly relying on character counting)

Considerations

I chose to inline the locale check (~9 lines) rather than adding the i18n feature dependency. The check is simple enough that duplicating it seemed better than pulling in ICU dependencies just for this.

If you'd prefer I use uucore::i18n instead, let me know and I'll update.

Fixes #10184

github-actions · 2026-02-23T02:06:48Z

GNU testsuite comparison:

Skipping an intermittent issue tests/pr/bounded-memory (passes in this run but fails in the 'main' branch)

src/uu/uniq/src/uniq.rs

github-actions · 2026-02-23T04:18:41Z

GNU testsuite comparison:

Skipping an intermittent issue tests/pr/bounded-memory (passes in this run but fails in the 'main' branch)

aguimaraes added 2 commits February 22, 2026 20:38

uniq: fix -w to count bytes in C locale

7f07583

uniq: add CTYPE to spellchecker ignore list

85fa5b9

xtqqczze reviewed Feb 23, 2026

View reviewed changes

src/uu/uniq/src/uniq.rs Show resolved Hide resolved

uniq: avoid String allocation by using std::env::var_os()

1bc1507

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Comments

uniq: fix -w to count bytes in C locale#11061

uniq: fix -w to count bytes in C locale#11061
aguimaraes wants to merge 3 commits intouutils:mainfrom
aguimaraes:uniq-fix-w-locale-bytes

aguimaraes commented Feb 23, 2026 •

edited

Loading

Uh oh!

github-actions bot commented Feb 23, 2026

Uh oh!

Uh oh!

github-actions bot commented Feb 23, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Comments

Conversation

aguimaraes commented Feb 23, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Changes

Considerations

Uh oh!

github-actions bot commented Feb 23, 2026

Uh oh!

Uh oh!

github-actions bot commented Feb 23, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

aguimaraes commented Feb 23, 2026 •

edited

Loading