Seeing these vulnerability issues on Dogfood every night #25090

getvictor · 2025-01-02T16:12:07Z

@lucasmrod : It does seem benign, so we should log error as debug and continue the for loop.

fleet/server/vulnerabilities/msrc/sync.go

Line 106 in 7ac39e2

if err := fsClient.Delete(d); err != nil {

Repro/QA

Enroll two hosts with different builds of the same version of Windows.
Run the vulnerabilities cron to pull the appropriate MSRC file.
Rename the MSRC file to match yesterday's date.
Run the vulnerabilities cron again.

Repro will show the above error. Fixed version won't.

iansltx · 2025-01-15T12:29:38Z

Digging deeper on this, as the deletion should only be picking up files that exist, so if we're attempting to delete files that we can't find when deleting something else may be at play. Still troubleshooting this.

For #25090. Not 100% sure why we're seeing this issue but this will drop the error severity, and even if a delete fails and leaves a file we'll pick the correct (latest) MSRC file for each OS anyway, so this is low-risk.

iansltx · 2025-03-11T06:38:01Z

@rfairburn Why might these files be getting deleted between the time we calculate a delta between files to download/delete and actually deleting the files? We're consistently seeing this once per day, with one file failing to delete on one hour (~1:22a UTC) and one failing to delete the next hour (~2:22a UTC). Given that we run the vulns cron hourly, this seems odd. Before downgrading the error (see the associated PR), I want to know why we're seeing this.

rfairburn · 2025-03-11T08:03:44Z

Does Dogfood already have the fix that prevents alerts from repeating every time a cron runs until the service is restarted? I'm pretty sure that made it into the RC, but not sure if that version of the RC has been applied to Dogfood yet.

This could have been a one-off thing for any number of reasons (we don't have persistent or shared storage at all for example as containers are intended to be stateless as much as possible), but would alert every cron interval with the same error if the alerting fix has not been deployed yet.

iansltx · 2025-03-11T15:01:41Z

The info above was from CloudWatch Logs, not alerts, and it only happens 2x per day (but happens 2x every day), so I don't think it's one-off, nor is it related to the the repeating alerts thing...I think?

…ilds of the same version of Windows For #25090.

iansltx · 2025-03-12T04:25:59Z

@rfairburn Your theory on this being due to multiple matches to the same MSRC file to delete was a sound one. Different builds of the same version of Windows is the culprit here (which is why you didn't see this in every environment). The included PR fixes that issue; thanks for the assist here!

…ilds of the same version of Windows (#27060) For #25090. # Checklist for submitter If some of the following don't apply, delete the relevant line.  - [x] Changes file added for user-visible changes in `changes/`, `orbit/changes/` or `ee/fleetd-chrome/changes`. See [Changes files](https://github.com/fleetdm/fleet/blob/main/docs/Contributing/Committing-Changes.md#changes-files) for more information. - [x] Input data is properly validated, `SELECT *` is avoided, SQL injection is prevented (using placeholders for values in statements) - [x] Added/updated automated tests - [x] A detailed QA plan exists on the associated ticket (if it isn't there, work with the product group's QA engineer to add it) - [x] Manual QA for all new/changed functionality

iansltx added the ~backend Backend-related issue. label Jan 2, 2025

mostlikelee assigned iansltx Jan 2, 2025

iansltx added this to the 4.63.0-tentative milestone Jan 2, 2025

mostlikelee removed this from the 4.63.0-tentative milestone Jan 2, 2025

mostlikelee added this to the 4.64.0-tentative milestone Jan 15, 2025

iansltx modified the milestones: 4.64.0, 4.65.0-tentative Feb 4, 2025

mostlikelee removed this from the 4.65.0-tentative milestone Feb 5, 2025

lukeheath added :product Product Design department (shows up on 🦢 Drafting board) and removed :release Ready to write code. Scheduled in a release. See "Making changes" in handbook. labels Feb 7, 2025

mostlikelee unassigned iansltx Feb 10, 2025

mostlikelee assigned ksykulev Feb 20, 2025

mostlikelee added :release Ready to write code. Scheduled in a release. See "Making changes" in handbook. and removed :product Product Design department (shows up on 🦢 Drafting board) labels Feb 20, 2025

mostlikelee assigned iansltx and unassigned ksykulev Feb 26, 2025

iansltx mentioned this issue Mar 11, 2025

Log rather than erroring when MSRC feed delete fails #27021

Closed

4 tasks

iansltx added a commit that referenced this issue Mar 12, 2025

Dedupe MSRC downloads/deletes when enrolled hosts include multiple bu…

9926bc6

…ilds of the same version of Windows For #25090.

iansltx mentioned this issue Mar 12, 2025

Dedupe MSRC downloads/deletes when enrolled hosts include multiple builds of the same version of Windows #27060

Merged

5 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Seeing these vulnerability issues on Dogfood every night #25090

Seeing these vulnerability issues on Dogfood every night #25090

getvictor commented Jan 2, 2025 •

edited by iansltx

Loading

iansltx commented Jan 15, 2025

iansltx commented Mar 11, 2025

rfairburn commented Mar 11, 2025

iansltx commented Mar 11, 2025

iansltx commented Mar 12, 2025

Seeing these vulnerability issues on Dogfood every night #25090

Seeing these vulnerability issues on Dogfood every night #25090

Comments

getvictor commented Jan 2, 2025 • edited by iansltx Loading

Repro/QA

iansltx commented Jan 15, 2025

iansltx commented Mar 11, 2025

rfairburn commented Mar 11, 2025

iansltx commented Mar 11, 2025

iansltx commented Mar 12, 2025

getvictor commented Jan 2, 2025 •

edited by iansltx

Loading