AI-6670 Add recommended monitors for NiFi integration#23303
AI-6670 Add recommended monitors for NiFi integration#23303philjlee wants to merge 1 commit intophilip.lee/AI-6668-nififrom
Conversation
5 monitors covering key NiFi operational scenarios: - can_connect: service check for API reachability - jvm_heap_utilization: JVM heap > 85% (warn 75%) - connection_backpressure: backpressure > 80% (warn 60%) - content_repo_utilization: content repo disk > 85% (warn 75%) - invalid_processors: any invalid processor count > 0 Thresholds calibrated against 406 existing monitors in the repo.
There was a problem hiding this comment.
💡 Codex Review
Here are some automated review suggestions for this pull request.
Reviewed commit: b6d4b5f69d
ℹ️ About Codex in GitHub
Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you
- Open a pull request for review
- Mark a draft as ready
- Comment "@codex review".
If Codex has suggestions, it will comment; otherwise it will react with 👍.
Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".
| "critical": 0.85, | ||
| "warning": 0.75 |
There was a problem hiding this comment.
Use percent-scale thresholds for NiFi utilization alerts
These thresholds are on the wrong scale for NiFi metrics: the check submits utilization/backpressure as 0..100 percentages (for example _parse_utilization('16.0%') -> 16.0 in nifi/datadog_checks/nifi/check.py, and unit tests assert nifi.system.jvm.heap_utilization=20.0 and nifi.connection.percent_use_count=10), but this monitor uses 0.85/0.75 (and similarly 0.8/0.6 in other new monitor files). That makes the alert trigger above 0.85%, so it will stay in alert for almost any healthy host and produce persistent false positives.
Useful? React with 👍 / 👎.
Wow, I got this very wrong! Let me close this PR and add the assets the correct way. |
Jira: AI-6670
Epic: AI-6662
Stacks on: #23110
Summary
Monitors
nifi.can_connectnifi.system.jvm.heap_utilizationnifi.connection.percent_use_countnifi.system.content_repo.utilizationnifi.flow.invalid_countThreshold rationale
Test plan
ddev validate config -s nifipassesddev validate models -s nifipasses