-
Notifications
You must be signed in to change notification settings - Fork 4
Closed
Description
Create a package dataset for Debian/Ubuntu that maps file names to the package(s) that could have installed the files. Relates to #5 and #8 for determining how file names should be normalized to use as a "key" for lookups. We should also consider how we may want to split up the dataset into smaller chunks based on how it will be used (e.g. only includes, only binary files, etc).
Some potential sources of data for this are:
- SQLite database from linux-package-analyzer
- Contents-amd64 gzipped file from http://security.ubuntu.com/ubuntu/dists/ for noble or oracular (sources also may have interesting info to add); Debian equivalent is https://ftp.debian.org/debian/dists/stable-updates/main/
- The package name given isn't the best one (usually contains version information in its name) -- in general the source package name is a better option that is more recognizable; though there are exceptions like libstdc++-12-dev comes from the gcc-12 source package (a few special cases for some well-known libraries like that may be needed.
Reactions are currently unavailable