DevOps-Project-41 by oleitao · Pull Request #35 · NotHarshhaa/DevOps-Projects

oleitao · 2026-05-22T22:26:38Z

Summary by CodeRabbit

New Features
- Async AI job API (POST /ask) + job status (GET /jobs/{jobId}), background worker, pluggable AI providers, health/readiness endpoints, Prometheus metrics and Grafana dashboard.
Chores
- Local Docker Compose, multi-stage Dockerfile, Kubernetes manifests (dev/prod overlays), KEDA autoscaling, hardened network/Kyverno policies, .gitignore, CI/release/security workflows with image signing and SBOM generation.
Tests
- Unit & contract tests, executable smoke script, k6 load test.
Documentation
- README, architecture, setup, observability, security, troubleshooting, validation and cleanup guides.

coderabbitai · 2026-05-22T22:26:51Z

Note

Reviews paused

It looks like this branch is under active development. To avoid overwhelming you with review comments due to an influx of new commits, CodeRabbit has automatically paused this review. You can configure this behavior by changing the reviews.auto_review.auto_pause_after_reviewed_commits setting.

Use the following commands to manage reviews:

@coderabbitai resume to resume automatic reviews.
@coderabbitai review to trigger a single review.

Use the checkboxes below for quick actions:

▶️ Resume reviews
🔍 Trigger review

No actionable comments were generated in the recent review. 🎉

ℹ️ Recent review info

⚙️ Run configuration

Configuration used: defaults

Review profile: CHILL

Plan: Pro

Run ID: 2aed1576-99e6-44ac-9c0e-a5f0ddfdade6

📥 Commits

Reviewing files that changed from the base of the PR and between 4144428 and e2e0b5b.

📒 Files selected for processing (2)

.github/workflows/project-41-ci.yml
.github/workflows/project-41-release.yml

📝 Walkthrough

Walkthrough

Adds a .NET 8 AI job platform: ASP.NET API, background worker, three AI providers, PostgreSQL/Redis persistence, Docker and Kubernetes manifests, observability, Kyverno policies, tests, load/smoke scripts, and GitHub Actions CI/CD, release, and security workflows.

Changes

AI Platform Full Stack

Layer / File(s)	Summary
Docker Build, Compose & Project config `DevOps-Project-41/app/Dockerfile`, `DevOps-Project-41/app/docker-compose.yml`, `DevOps-Project-41/app/AiPlatform.sln`, `DevOps-Project-41/app/.dockerignore`, `.gitignore`	Multi-stage .NET 8 Dockerfile producing api/worker runtime images; docker-compose for local stack (api, worker, Redis, Postgres, OTEL, Prometheus, Grafana); solution and ignore configs added.
AI Provider Abstraction & Implementations `DevOps-Project-41/app/src/AiProvider/*`	`IAiProvider` contract plus `MockLlmProvider`, `OllamaProvider`, and `OpenAiCompatibleProvider` implementations and provider project config.
API Models & Persistence `DevOps-Project-41/app/src/AiApi/Models.cs`, `DevOps-Project-41/app/src/AiApi/JobRepository.cs`, `DevOps-Project-41/app/src/AiApi/AiApi.csproj`	Request/response records and `JobRepository` for ai_jobs table (ensure schema, insert, fetch, connectivity).
API Endpoints & App Host `DevOps-Project-41/app/src/AiApi/Program.cs`, `DevOps-Project-41/app/src/AiApi/appsettings.json`	ASP.NET Core app wiring: Redis, JobRepository DI, OpenTelemetry/Prometheus, `/ask` and `/jobs/{jobId}` endpoints, health/readiness checks.
Worker Persistence & Updater `DevOps-Project-41/app/src/AiWorker/JobUpdater.cs`	`JobUpdater` updates job lifecycle states (processing/completed/failed) in PostgreSQL with parameterized Npgsql commands.
Worker Service & Host `DevOps-Project-41/app/src/AiWorker/Worker.cs`, `DevOps-Project-41/app/src/AiWorker/Program.cs`, `DevOps-Project-41/app/src/AiWorker/AiWorker.csproj`, `DevOps-Project-41/app/src/AiWorker/appsettings.json`	BackgroundService polling Redis, invoking `IAiProvider`, recording traces/metrics, updating job state; host selects provider by `AI_PROVIDER`, configures OpenTelemetry and Prometheus metric server.
Tests: unit, contract, load, smoke `DevOps-Project-41/app/tests/*`, `DevOps-Project-41/tests/load/k6-ai-jobs.js`, `DevOps-Project-41/tests/smoke/smoke-test.sh`	xUnit tests for mock provider and provider contracts; k6 load script and bash smoke-test for end-to-end verification.
Kubernetes Manifests & Overlays `DevOps-Project-41/k8s/base/`, `DevOps-Project-41/k8s/overlays/`	Kustomize base and dev/prod overlays: Deployments, Services, Namespace, ConfigMap, Secrets example, Postgres StatefulSet, Redis, KEDA ScaledObject, NetworkPolicies, ServiceAccount.
Observability `DevOps-Project-41/observability/*`	OTel Collector configs, Prometheus configs/values, Loki values, Grafana dashboard JSON for platform metrics.
Security: SBOM, Trivy, Cosign, Kyverno `DevOps-Project-41/security/*`	Trivy config, SBOM and Cosign docs, Kyverno ClusterPolicies to enforce non-root, resource limits, labels, deny privileged, and disallow :latest tags.
CI/CD & Security Workflows `.github/workflows/project-41-ci.yml`, `.github/workflows/project-41-release.yml`, `.github/workflows/project-41-security.yml`, plus `DevOps-Project-41/.github/workflows/*`	CI workflow for build/test and Docker Buildx smoke builds; release workflow builds/pushes GHCR images, keyless Cosign signing, SBOM generation/attestation, and verification; security workflow runs Trivy filesystem/config/image scans and uploads SARIF artifacts.
Documentation & Guides `DevOps-Project-41/README.md`, `architecture/`, `docs/`, `infra/terraform/README.md`, `gitops/*`	Comprehensive architecture, setup (local/cloud), observability, security, gitops, troubleshooting, validation checklist, cleanup, and Terraform README.

Estimated code review effort

🎯 4 (Complex) | ⏱️ ~45 minutes

Poem

🐰 I hopped through code and spun a test,
Queues and traces do their best,
Workers hum and APIs cheer,
Signed and scanned, the release is near—
A carrot-shaped CI trail, sincere!

✨ Finishing Touches

🧪 Generate unit tests (beta)

Create PR with unit tests

coderabbitai

Actionable comments posted: 20

🧹 Nitpick comments (4)

DevOps-Project-41/app/src/AiApi/JobRepository.cs (1)

22-25: ⚡ Quick win

Enforce valid job states at the database layer.

status is unconstrained text, so invalid lifecycle values can be persisted. Add a CHECK constraint to protect data integrity.

Proposed fix

             CREATE TABLE IF NOT EXISTS ai_jobs (
                 job_id      TEXT PRIMARY KEY,
-                status      TEXT NOT NULL DEFAULT 'queued',
+                status      TEXT NOT NULL DEFAULT 'queued'
+                            CHECK (status IN ('queued','processing','completed','failed')),
                 prompt      TEXT NOT NULL,

🤖 Prompt for AI Agents

Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

In `@DevOps-Project-41/app/src/AiApi/JobRepository.cs` around lines 22 - 25, The
ai_jobs table currently allows any text in the status column; update the CREATE
TABLE SQL in JobRepository.cs so the status column includes a CHECK constraint
limiting values to the canonical lifecycle set (e.g. CHECK(status IN
('queued','running','completed','failed','cancelled'))); if the DB may already
exist, add a migration path that issues an ALTER TABLE ai_jobs ADD CONSTRAINT
... CHECK(...) to enforce the same allowed values for existing databases.

DevOps-Project-41/app/tests/AiApi.Tests/MockLlmProviderTests.cs (1)

45-58: 💤 Low value

Consider including "gitops" in the theory test for consistency.

The theory test covers four keywords (kubernetes, devsecops, keda, opentelemetry) but omits gitops, which is tested separately in lines 12-22. Including all five keywords in the theory would provide more consistent coverage and reduce test duplication.
Suggested change
 [Theory]
+[InlineData("gitops")]
 [InlineData("kubernetes")]
 [InlineData("devsecops")]
 [InlineData("keda")]
 [InlineData("opentelemetry")]
 public async Task CompleteAsync_RecognisesAllKeywords(string keyword)
You could then remove or simplify the CompleteAsync_ReturnsResponse_ForKnownKeyword test if desired, or keep it for its specific assertions about GitOps content.
🤖 Prompt for AI Agents
Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

In `@DevOps-Project-41/app/tests/AiApi.Tests/MockLlmProviderTests.cs` around lines
45 - 58, The theory test CompleteAsync_RecognisesAllKeywords currently omits
"gitops", causing redundant separate test
CompleteAsync_ReturnsResponse_ForKnownKeyword; update
CompleteAsync_RecognisesAllKeywords to include [InlineData("gitops")] so the
AiRequest cases cover all keywords, then remove or simplify the separate
CompleteAsync_ReturnsResponse_ForKnownKeyword test if desired; ensure you
reference the same _provider.CompleteAsync(request) call and existing assertions
(Assert.NotEmpty and Assert.DoesNotContain) so behavior remains identical.

DevOps-Project-41/app/src/AiProvider/IAiProvider.cs (1)

9-11: 💤 Low value

Consider validating non-empty strings in records.

The AiRequest and AiResponse records accept non-nullable strings but don't validate that they're non-empty. While nullable reference types prevent null values, callers could still pass empty strings for Prompt, Model, JobId, or other fields. If this is an internal contract between trusted components, the current design may be acceptable. Otherwise, consider adding validation.

Example validation approach

-public record AiRequest(string Prompt, string Model, string JobId);
+public record AiRequest(string Prompt, string Model, string JobId)
+{
+    public AiRequest
+    {
+        ArgumentException.ThrowIfNullOrWhiteSpace(Prompt);
+        ArgumentException.ThrowIfNullOrWhiteSpace(Model);
+        ArgumentException.ThrowIfNullOrWhiteSpace(JobId);
+    }
+}

Similar validation could be added to AiResponse if needed.

🤖 Prompt for AI Agents

Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

In `@DevOps-Project-41/app/src/AiProvider/IAiProvider.cs` around lines 9 - 11,
AiRequest and AiResponse currently accept non-null strings but don't guard
against empty values; add validation in their record primary constructors
(AiRequest and AiResponse) to throw ArgumentException (or ArgumentNullException)
when Prompt, Model, JobId, Content, or Provider are null/empty or whitespace.
Implement this by adding a compact constructor or init-time checks inside the
record types (e.g., validate Prompt/Model/JobId in AiRequest and
JobId/Content/Model/Provider in AiResponse) and include clear parameter names in
the exception messages so callers know which field is invalid.

DevOps-Project-41/app/Dockerfile (1)

47-47: ⚡ Quick win

Use a leaner runtime image for the worker container.

Line 47 uses aspnet runtime for AiWorker, but the worker does not need ASP.NET hosting. Using dotnet/runtime reduces image size and attack surface.

Suggested change

-FROM mcr.microsoft.com/dotnet/aspnet:8.0-alpine AS worker
+FROM mcr.microsoft.com/dotnet/runtime:8.0-alpine AS worker

🤖 Prompt for AI Agents

Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

In `@DevOps-Project-41/app/Dockerfile` at line 47, Replace the ASP.NET runtime
base image for the Docker build stage named "worker" with the leaner .NET
runtime image: change the FROM line that currently reads
"mcr.microsoft.com/dotnet/aspnet:8.0-alpine AS worker" to use
"mcr.microsoft.com/dotnet/runtime:8.0-alpine AS worker", verify that AiWorker
(the worker stage) has no ASP.NET-specific dependencies, update any
runtime-dependent commands if needed, and rebuild the image to confirm the
smaller runtime image works correctly.

🤖 Prompt for all review comments with AI agents

Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

Inline comments:
In @.github/workflows/project-41-ci.yml:
- Line 27: The workflow uses floating action tags (e.g., actions/checkout@v4,
actions/setup-dotnet@v4, actions/upload-artifact@v4,
docker/setup-buildx-action@v3, docker/build-push-action@v6) which should be
pinned to immutable commit SHAs; locate the workflow entries that reference
those action identifiers and replace each tag with the corresponding full commit
SHA for that action's repository (use the latest stable commit SHA for each
action version), updating both occurrences of docker/build-push-action@v6;
ensure the `uses:` lines for actions/checkout, actions/setup-dotnet,
actions/upload-artifact, docker/setup-buildx-action, and
docker/build-push-action are all changed to use the full commit SHAs.
- Around line 26-27: The checkout steps using actions/checkout@v4 leave the
runner git credentials persisted; update each checkout step (both occurrences of
the actions/checkout@v4 invocation) to include a with: block setting
persist-credentials: false so the GITHUB_TOKEN is not left in the runner git
config. Ensure both checkout steps are modified (the first and the second
actions/checkout@v4 usages) to include persist-credentials: false.

In @.github/workflows/project-41-release.yml:
- Around line 35-36: The checkout step using actions/checkout@v4 (the step named
"Checkout") currently does not set persist-credentials; update that step to
include persist-credentials: false so the GITHUB_TOKEN is not persisted to the
checked-out repository by default, e.g., add the persist-credentials: false key
under the actions/checkout@v4 step configuration.
- Around line 7-13: The workflow defines a manual input
workflow_dispatch.inputs.tag but never uses it; either remove the unused
inputs.tag or wire it into the image-tagging steps so manual runs can override
the computed tag. Update the job/step that currently derives tags from metadata
to reference github.event.inputs.tag (with a sensible fallback to the metadata
value), or delete the inputs.tag block if you prefer metadata-only tags; ensure
any uses mention workflow_dispatch and inputs.tag (github.event.inputs.tag) so
the input actually controls the produced image tag.
- Line 36: The workflow uses mutable action tags; update each action reference
to an immutable commit SHA: replace actions/checkout@v4,
docker/metadata-action@v5, actions/setup-dotnet@v4,
docker/setup-buildx-action@v3, docker/login-action@v3,
docker/build-push-action@v6, sigstore/cosign-installer@v3,
aquasecurity/trivy-action@0.24.0, and actions/upload-artifact@v4 with their
corresponding pinned commit SHAs (obtain the exact commit for each action
repository and substitute the tag/version string with the full 40-char SHA) so
the workflow uses immutable action versions.

In @.github/workflows/project-41-security.yml:
- Line 27: The workflow uses mutable tags (e.g., actions/checkout@v4,
aquasecurity/trivy-action@0.24.0, github/codeql-action/upload-sarif@v3,
docker/setup-buildx-action@v3, docker/build-push-action@v6,
actions/upload-artifact@v4); replace each `uses:` entry with the corresponding
immutable commit SHA (e.g., actions/checkout@<commit-sha>) by looking up the
exact commit for the referenced tag/release in each action's GitHub repo and
substituting that SHA for the tag so the workflow pins to a fixed revision and
update all occurrences (the three actions/checkout entries and the other listed
actions).

In `@DevOps-Project-41/app/docker-compose.yml`:
- Around line 15-16: The Compose file currently embeds plaintext credentials in
environment variables like POSTGRES_CONNECTION_STRING and other service env
keys; replace those hardcoded values with environment interpolation (e.g., use
${POSTGRES_CONNECTION_STRING} and ${AI_PROVIDER} or secret references) and
require callers to supply them via a .env or orchestration secrets store, update
the compose services that set POSTGRES_CONNECTION_STRING, AI_PROVIDER and any
other hardcoded DB credentials to read from those environment variables/secrets,
and add a note to the project README to provide a template .env (and ensure the
real .env is gitignored) so values are injected at runtime rather than
committed.
- Around line 53-55: Change the open port mappings in docker-compose.yml so
Redis and Postgres bind only to localhost instead of all interfaces: update the
Redis service's ports entry (the ports: block shown with "6379:6379") to use
127.0.0.1 as the host IP, and do the same for the Postgres service ports block
(the ports lines referenced around lines 64–66) so they bind to 127.0.0.1 (e.g.
host:container mapping using 127.0.0.1) to limit exposure in development.

In `@DevOps-Project-41/app/src/AiApi/AiApi.csproj`:
- Around line 17-25: The OpenTelemetry packages are mixed across 1.9.x and
1.15.x lines; align them to a single stable version (e.g., 1.15.3) by updating
PackageReference entries for OpenTelemetry.Extensions.Hosting,
OpenTelemetry.Instrumentation.AspNetCore, OpenTelemetry.Instrumentation.Http,
OpenTelemetry.Instrumentation.Runtime,
OpenTelemetry.Exporter.OpenTelemetryProtocol and
OpenTelemetry.Exporter.Prometheus.AspNetCore so they all use the same 1.15.x
version (use 1.15.3 for exporters and instrumentation and update Runtime from
1.15.1 to 1.15.3) to ensure consistent OpenTelemetry versions.

In `@DevOps-Project-41/app/src/AiApi/appsettings.json`:
- Line 10: The Postgres connection string currently contains plaintext
credentials under the "Postgres" key; remove the hardcoded value and replace it
with a non-sensitive placeholder in appsettings.json, then load the real
connection string from an environment variable or user-secrets at runtime (e.g.,
read via Configuration.GetConnectionString or Configuration["Postgres"] / env
var) so secrets are not stored in source control; update any code that
references the "Postgres" key (startup/Program.cs or wherever the DB is
constructed) to fall back to a safe default or throw a clear error if the env
var/user-secret is missing.

In `@DevOps-Project-41/app/src/AiApi/JobRepository.cs`:
- Around line 11-14: The constructor currently assigns _connectionString with a
hardcoded fallback credential string; remove that insecure default and instead
validate the configuration result of config.GetConnectionString("Postgres") ??
config["POSTGRES_CONNECTION_STRING"], and if it is null or whitespace throw an
exception (e.g., ArgumentException or InvalidOperationException) so the app
fails fast; update the _connectionString assignment in the JobRepository
constructor to only use the configured value and add a clear error message
mentioning missing Postgres connection string.

In `@DevOps-Project-41/app/src/AiApi/Program.cs`:
- Around line 90-107: Postgres insert (repo.InsertJobAsync) can succeed while
Redis enqueue (db.ListLeftPushAsync) fails, leaving an orphaned Job with status
Queued; after calling ListLeftPushAsync if it throws, call a compensating
repository method (e.g., repo.UpdateJobStatusAsync or repo.MarkJobFailedAsync on
the same jobId) to set JobStatus.Failed (or remove the job) and update
metrics/logging (jobsFailed.Inc(), logger.LogError) before returning the error;
implement this cleanup in the try/catch around InsertJobAsync/ListLeftPushAsync
(or adopt an outbox pattern) so InsertJobAsync, ListLeftPushAsync, and the
compensating call are coordinated for jobId.
- Around line 20-35: The tracing pipeline isn't registering the custom
ActivitySource used by ActivitySource("ai-api") and
StartActivity("http.post.ask"), so those activities won't be captured; inside
the WithTracing(...) configuration where WithTracing(t => { ... }) is defined in
Program.cs, call t.AddSource("ai-api") (alongside AddAspNetCoreInstrumentation
and AddHttpClientInstrumentation) so the custom activities are included, leaving
the existing OTLP exporter logic unchanged.

In `@DevOps-Project-41/app/src/AiWorker/appsettings.json`:
- Line 9: The appsettings.json currently contains a sensitive Postgres
connection string under the "Postgres" key; remove the inline credentials and
replace the value with a non-sensitive placeholder, and update the app
configuration to read the real connection string from an environment variable or
user-secrets instead. Specifically, change the "Postgres" entry to a placeholder
value and modify the startup/config code that reads Configuration["Postgres"]
(or whichever code uses the "Postgres" key) to prefer
Environment.GetEnvironmentVariable("POSTGRES_CONNECTION") or the user-secrets
provider so credentials are not stored in source control. Ensure any local dev
instructions or README mention setting POSTGRES_CONNECTION in
environment/user-secrets.

In `@DevOps-Project-41/app/src/AiWorker/JobUpdater.cs`:
- Around line 16-24: The update methods (e.g., MarkProcessingAsync) ignore
ExecuteNonQueryAsync() return value so a missing job_id is silently ignored;
change each updater to capture the affected rows (var affected = await
cmd.ExecuteNonQueryAsync()), and if affected == 0 throw an exception (e.g.,
InvalidOperationException) including the jobId and operation name (or log then
throw) to fail fast and prevent state drift; apply this change to
MarkProcessingAsync and the other update methods that run similar UPDATE
statements.
- Around line 11-13: The constructor currently falls back to a hardcoded inline
connection string via _connectionString = config.GetConnectionString("Postgres")
?? config["POSTGRES_CONNECTION_STRING"] ??
"Host=localhost;Database=aiops;Username=aiops;Password=aiops"; which embeds
credentials; remove that hardcoded default and instead require a configured
connection string (from config.GetConnectionString("Postgres") or
config["POSTGRES_CONNECTION_STRING"]) and throw a clear exception (e.g.,
InvalidOperationException) from the JobUpdater constructor if none is provided
so the app fails fast; update any code that relied on the fallback to expect an
explicit connection string or supply it via environment/configuration/secrets.

In `@DevOps-Project-41/app/src/AiWorker/Program.cs`:
- Around line 31-34: When AI_PROVIDER equals "openai-compatible" the app
currently constructs an OpenAiCompatibleProvider without ensuring OPENAI_API_KEY
exists; update Program.cs to read builder.Configuration["OPENAI_API_KEY"] before
calling CreateHttpClient/OpenAiCompatibleProvider, validate it is non-empty, and
fail fast if missing (log a clear error via loggerFactory.CreateLogger or throw
an InvalidOperationException/terminate startup) so OpenAiCompatibleProvider
initialization never proceeds with a missing key.
- Around line 42-49: The tracing setup in Program.cs doesn’t register the
ActivitySource used by Worker (Worker.cs creates spans with new("ai-worker")),
so add the source to the OpenTelemetry tracer provider by calling
AddSource("ai-worker") on the TracerProviderBuilder inside the existing
builder.Services.AddOpenTelemetry().WithTracing(...) block (i.e., update the
WithTracing lambda that currently configures AddHttpClientInstrumentation and
AddOtlpExporter to also call AddSource("ai-worker")).

In `@DevOps-Project-41/app/src/AiWorker/Worker.cs`:
- Around line 88-94: The current catch in ProcessJobAsync treats all exceptions
as job failures; change it to handle cancellation separately by detecting
OperationCanceledException/TaskCanceledException and rethrowing (or simply have
a dedicated catch for OperationCanceledException that rethrows) so you do not
call _jobUpdater.MarkFailedAsync, JobsFailed.Inc(), or activity.SetStatus for
cancellations; only non-cancellation exceptions should execute the failure path
that logs via _logger.LogError and calls _jobUpdater.MarkFailedAsync(job.JobId,
ex.Message).
- Around line 49-50: The code currently drops malformed queue payloads when
JsonSerializer.Deserialize<JobMessage>(raw!) returns null or throws; update the
Worker (Worker.cs) processing around the deserialization (the var job =
JsonSerializer.Deserialize<JobMessage>(raw!) site) to catch deserialization
exceptions and handle null results: log an error including the raw payload and
exception details, increment/emit a failure metric, and move the bad message to
a dead-letter or failed-state queue (or requeue with backoff) instead of
silently continuing. Ensure you reference JobMessage,
JsonSerializer.Deserialize, and the Worker message-processing method when making
the changes so the payload is not lost without telemetry.

---

Nitpick comments:
In `@DevOps-Project-41/app/Dockerfile`:
- Line 47: Replace the ASP.NET runtime base image for the Docker build stage
named "worker" with the leaner .NET runtime image: change the FROM line that
currently reads "mcr.microsoft.com/dotnet/aspnet:8.0-alpine AS worker" to use
"mcr.microsoft.com/dotnet/runtime:8.0-alpine AS worker", verify that AiWorker
(the worker stage) has no ASP.NET-specific dependencies, update any
runtime-dependent commands if needed, and rebuild the image to confirm the
smaller runtime image works correctly.

In `@DevOps-Project-41/app/src/AiApi/JobRepository.cs`:
- Around line 22-25: The ai_jobs table currently allows any text in the status
column; update the CREATE TABLE SQL in JobRepository.cs so the status column
includes a CHECK constraint limiting values to the canonical lifecycle set (e.g.
CHECK(status IN ('queued','running','completed','failed','cancelled'))); if the
DB may already exist, add a migration path that issues an ALTER TABLE ai_jobs
ADD CONSTRAINT ... CHECK(...) to enforce the same allowed values for existing
databases.

In `@DevOps-Project-41/app/src/AiProvider/IAiProvider.cs`:
- Around line 9-11: AiRequest and AiResponse currently accept non-null strings
but don't guard against empty values; add validation in their record primary
constructors (AiRequest and AiResponse) to throw ArgumentException (or
ArgumentNullException) when Prompt, Model, JobId, Content, or Provider are
null/empty or whitespace. Implement this by adding a compact constructor or
init-time checks inside the record types (e.g., validate Prompt/Model/JobId in
AiRequest and JobId/Content/Model/Provider in AiResponse) and include clear
parameter names in the exception messages so callers know which field is
invalid.

In `@DevOps-Project-41/app/tests/AiApi.Tests/MockLlmProviderTests.cs`:
- Around line 45-58: The theory test CompleteAsync_RecognisesAllKeywords
currently omits "gitops", causing redundant separate test
CompleteAsync_ReturnsResponse_ForKnownKeyword; update
CompleteAsync_RecognisesAllKeywords to include [InlineData("gitops")] so the
AiRequest cases cover all keywords, then remove or simplify the separate
CompleteAsync_ReturnsResponse_ForKnownKeyword test if desired; ensure you
reference the same _provider.CompleteAsync(request) call and existing assertions
(Assert.NotEmpty and Assert.DoesNotContain) so behavior remains identical.

🪄 Autofix (Beta)

Fix all unresolved CodeRabbit comments on this PR:

Push a commit to this branch (recommended)
Create a new PR with the fixes

ℹ️ Review info

⚙️ Run configuration

Configuration used: defaults

Review profile: CHILL

Plan: Pro

Run ID: 4966cda9-ebb8-4d11-a8d0-b8176b916b80

📥 Commits

Reviewing files that changed from the base of the PR and between a38cc0f and 1256980.

📒 Files selected for processing (27)

.github/workflows/project-41-ci.yml
.github/workflows/project-41-release.yml
.github/workflows/project-41-security.yml
.gitignore
DevOps-Project-41/app/.dockerignore
DevOps-Project-41/app/AiPlatform.sln
DevOps-Project-41/app/Dockerfile
DevOps-Project-41/app/docker-compose.yml
DevOps-Project-41/app/src/AiApi/AiApi.csproj
DevOps-Project-41/app/src/AiApi/JobRepository.cs
DevOps-Project-41/app/src/AiApi/Models.cs
DevOps-Project-41/app/src/AiApi/Program.cs
DevOps-Project-41/app/src/AiApi/appsettings.json
DevOps-Project-41/app/src/AiProvider/AiProvider.csproj
DevOps-Project-41/app/src/AiProvider/IAiProvider.cs
DevOps-Project-41/app/src/AiProvider/MockLlmProvider.cs
DevOps-Project-41/app/src/AiProvider/OllamaProvider.cs
DevOps-Project-41/app/src/AiProvider/OpenAiCompatibleProvider.cs
DevOps-Project-41/app/src/AiWorker/AiWorker.csproj
DevOps-Project-41/app/src/AiWorker/JobUpdater.cs
DevOps-Project-41/app/src/AiWorker/Program.cs
DevOps-Project-41/app/src/AiWorker/Worker.cs
DevOps-Project-41/app/src/AiWorker/appsettings.json
DevOps-Project-41/app/tests/AiApi.Tests/AiApi.Tests.csproj
DevOps-Project-41/app/tests/AiApi.Tests/MockLlmProviderTests.cs
DevOps-Project-41/app/tests/AiWorker.Tests/AiProviderContractTests.cs
DevOps-Project-41/app/tests/AiWorker.Tests/AiWorker.Tests.csproj

coderabbitai

Actionable comments posted: 16

Note

Due to the large number of review comments, Critical, Major severity comments were prioritized as inline comments.

Caution

Some comments are outside the diff and can’t be posted inline due to platform limitations.

⚠️ Outside diff range comments (1)

DevOps-Project-41/k8s/base/networkpolicy.yaml (1)
1-69: ⚠️ Potential issue | 🟠 Major | ⚡ Quick win

Postgres is still completely open inside the cluster.

This file isolates ai-api, ai-worker, and redis, but nothing selects pods with app.kubernetes.io/name: postgres. The StatefulSet in DevOps-Project-41/k8s/base/postgres-statefulset.yaml uses that label, so the database remains non-isolated and reachable from any pod unless another policy exists elsewhere.
🤖 Prompt for AI Agents
Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

In `@DevOps-Project-41/k8s/base/networkpolicy.yaml` around lines 1 - 69, Add a
NetworkPolicy that selects pods with label "app.kubernetes.io/name: postgres"
(e.g., name it postgres-netpol) and restricts ingress to only the pods that
actually need DB access: create spec.policyTypes: ["Ingress"], spec.ingress
allowing from podSelector(s) for the callers (for example matchLabels
app.kubernetes.io/name: ai-api, app.kubernetes.io/name: ai-worker or
app.kubernetes.io/part-of: ai-native-devsecops-platform) and allow port: 5432
TCP; do not leave an open-wide ingress rule so the Postgres StatefulSet pods are
isolated.

🟡 Minor comments (8)

DevOps-Project-41/docs/cleanup.md-60-61 (1)

60-61: ⚠️ Potential issue | 🟡 Minor | ⚡ Quick win

Harden artifact cleanup command to avoid path-splitting issues.

Line 60 uses whitespace-delimited xargs, which can break on unusual paths and is less safe than null-delimited cleanup.

Suggested patch

-find . -name "bin" -o -name "obj" | xargs rm -rf
+find . -type d \( -name "bin" -o -name "obj" \) -print0 | xargs -0 -r rm -rf

🤖 Prompt for AI Agents

Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

In `@DevOps-Project-41/docs/cleanup.md` around lines 60 - 61, The cleanup command
using a pipe to xargs ("find . -name \"bin\" -o -name \"obj\" | xargs rm -rf")
is unsafe for paths with whitespace or special characters; update the command to
use a null-delimited approach (make find emit NULs and pass to xargs with the
NUL option) or switch to find's safe execution form (use find's exec variant to
remove matches directly), keeping the existing target patterns (bin/obj) and the
subsequent rm -f sbom-*.json trivy-*.sarif TestResults/*.trx behavior unchanged.

DevOps-Project-41/tests/smoke/smoke-test.sh-56-63 (1)

56-63: ⚠️ Potential issue | 🟡 Minor | ⚡ Quick win

Remove duplicate /ask call and assert status from the same request.

Line 60 creates a second job and ask_status is never validated. This makes the smoke test noisier and less deterministic.

Suggested patch

-ask_response=$(curl -s -X POST "$API_URL/ask" \
-  -H "Content-Type: application/json" \
-  -d '{"prompt":"Explain GitOps in simple terms","model":"mock-devops-model"}')
-
-ask_status=$(curl -s -o /dev/null -w "%{http_code}" -X POST "$API_URL/ask" \
-  -H "Content-Type: application/json" \
-  -d '{"prompt":"second job","model":"mock-devops-model"}')
+ask_raw=$(curl -s -w $'\n%{http_code}' -X POST "$API_URL/ask" \
+  -H "Content-Type: application/json" \
+  -d '{"prompt":"Explain GitOps in simple terms","model":"mock-devops-model"}')
+ask_response=$(echo "$ask_raw" | sed '$d')
+ask_status=$(echo "$ask_raw" | tail -n1)
+assert_eq "POST /ask returns 202" "202" "$ask_status"

🤖 Prompt for AI Agents

Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

In `@DevOps-Project-41/tests/smoke/smoke-test.sh` around lines 56 - 63, The script
makes two POSTs to the /ask endpoint (producing ask_response and ask_status) but
never validates ask_status; remove the duplicate POST that sets ask_status and
instead capture the HTTP status for the original request that populates
ask_response (e.g., use curl -w "%{http_code}" with -o/-s to store both body and
status from the single call). Update references to ask_status to use the
captured status from the same /ask request so the test asserts response code and
body deterministically.

DevOps-Project-41/docs/troubleshooting.md-9-11 (1)

9-11: ⚠️ Potential issue | 🟡 Minor | ⚡ Quick win

Add language identifiers to unlabeled fenced code blocks.

These fences trigger MD040 warnings. Use text (or bash where applicable) to keep markdownlint clean.
Suggested patch
-```
+```text
 Error response from daemon: Ports are not available: exposing port TCP 0.0.0.0:8080
- +text
redis.ai-devsecops.svc.cluster.local:6379
-```
+```text
Host=postgres;Database=aiops;Username=aiops;Password=aiops
- +text
Host=postgres.ai-devsecops.svc.cluster.local;Database=aiops;Username=aiops;Password=
Also applies to: 127-129, 188-190, 194-196
🤖 Prompt for AI Agents
Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

In `@DevOps-Project-41/docs/troubleshooting.md` around lines 9 - 11, Several
fenced code blocks in troubleshooting.md are missing language identifiers (e.g.,
the blocks containing "Error response from daemon: Ports are not available:
exposing port TCP 0.0.0.0:8080", "redis.ai-devsecops.svc.cluster.local:6379",
"Host=postgres;Database=aiops;Username=aiops;Password=aiops", and
"Host=postgres.ai-devsecops.svc.cluster.local;Database=aiops;Username=aiops;Password=<from-secret>");
add an appropriate info string (use text or bash) after each opening triple
backtick so markdownlint MD040 is satisfied and the examples remain unchanged.

DevOps-Project-41/docs/observability.md-5-12 (1)

5-12: ⚠️ Potential issue | 🟡 Minor | ⚡ Quick win

Add a language to the first fenced block.

This block is unlabeled and trips MD040; use text for the architecture sketch.

🤖 Prompt for AI Agents

Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

In `@DevOps-Project-41/docs/observability.md` around lines 5 - 12, The fenced code
block containing the architecture sketch starting with "Application (ai-api,
ai-worker)" is unlabeled and triggers MD040; add the language tag `text` to the
opening fence (i.e., change the triple-backtick that begins the block to
```text) so the block is explicitly identified as plain text.

DevOps-Project-41/architecture/architecture.md-47-55 (1)

47-55: ⚠️ Potential issue | 🟡 Minor | ⚡ Quick win

Label fenced diagrams/flows with a language.

These blocks are currently unlabeled and trigger MD040. text is sufficient for all three.

Also applies to: 71-92, 96-100

🤖 Prompt for AI Agents

Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

In `@DevOps-Project-41/architecture/architecture.md` around lines 47 - 55, The
fenced diagram blocks (e.g., the block starting with "Application SDK
(OpenTelemetry) → OpenTelemetry Collector (OTLP gRPC) ..." and the other blocks
around lines 71-92 and 96-100) are unlabeled and trigger MD040; fix by changing
the opening triple backticks from ``` to ```text so each diagram/flow code fence
is explicitly labeled with the language "text".

DevOps-Project-41/.github/workflows/ci.yml-9-13 (1)

9-13: ⚠️ Potential issue | 🟡 Minor | ⚡ Quick win

Include workflow file in PR path filters.

pull_request.paths currently misses .github/workflows/ci.yml, so CI changes in PRs can bypass this workflow.

Suggested patch

   pull_request:
     branches: [devops-project, master]
     paths:
       - "DevOps-Project-41/app/**"
+      - ".github/workflows/ci.yml"

🤖 Prompt for AI Agents

Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

In `@DevOps-Project-41/.github/workflows/ci.yml` around lines 9 - 13, The
pull_request.paths filter in the GitHub Actions workflow omits the workflow file
itself (so changes to .github/workflows/ci.yml won't trigger the workflow);
update the pull_request.paths array in the ci.yml to include the workflow file
(for example add ".github/workflows/ci.yml" or broaden to ".github/**") so that
edits to the workflow are covered by the pull_request trigger, ensuring the
change is made in the pull_request.paths block of the
DevOps-Project-41/.github/workflows/ci.yml file.

DevOps-Project-41/docs/gitops.md-5-12 (1)

5-12: ⚠️ Potential issue | 🟡 Minor | ⚡ Quick win

Add a language to the fenced code block.

This block is missing a fence language and triggers MD040.
Suggested fix
-```
+```text
 Git commit
   → GitHub Actions CI (build + test + scan)
   → Release pipeline (push to GHCR + sign)
   → Pipeline updates image digest in k8s/overlays/dev/kustomization.yaml
   → Argo CD detects change in Git
   → Argo CD syncs desired state to Kubernetes cluster
</details>

<details>
<summary>🤖 Prompt for AI Agents</summary>
Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

In @DevOps-Project-41/docs/gitops.md around lines 5 - 12, The fenced code block
in docs/gitops.md lacks a language tag (causing MD040); update the
triple-backtick fence to include a language identifier (e.g., change ``` to
linter to confirm the warning is resolved.

DevOps-Project-41/docs/gitops.md-39-43 (1)

39-43: ⚠️ Potential issue | 🟡 Minor | ⚡ Quick win

Use a single path convention for the dev app manifest.

Line 39 and Line 42 use different paths for the same file, which can confuse runbooks.

Suggested fix

-kubectl apply -f DevOps-Project-41/gitops/argocd-app-dev.yaml
+kubectl apply -f gitops/argocd-app-dev.yaml

🤖 Prompt for AI Agents

Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

In `@DevOps-Project-41/docs/gitops.md` around lines 39 - 43, The doc uses two
different paths for the same manifest
(DevOps-Project-41/gitops/argocd-app-dev.yaml vs gitops/argocd-app-dev.yaml)
which can confuse users; update the instructions in the gitops.md lines
referencing the dev app so they use a single consistent path convention (choose
either repository-root-prefixed "DevOps-Project-41/gitops/argocd-app-dev.yaml"
or the relative "gitops/argocd-app-dev.yaml") and apply that same path in both
the kubectl apply and the surrounding prose where `argocd-app-dev.yaml` is
mentioned (references: the manifest name argocd-app-dev.yaml and the kubectl
commands shown).

🤖 Prompt for all review comments with AI agents

Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

Inline comments:
In `@DevOps-Project-41/.github/workflows/release.yml`:
- Around line 7-12: The workflow declares workflow_dispatch.inputs.tag but never
uses it when tagging/pushing images; update the job to read
github.event.inputs.tag (e.g., assign it to an env var like IMAGE_TAG = ${{
github.event.inputs.tag }}) and replace any hardcoded or default tag usage in
the build/push steps (references to docker/build-push-action or any "push" steps
that currently use "latest" or a literal tag) to use that IMAGE_TAG; apply the
same change to the other push steps mentioned (the block around lines 77-90) so
all image push actions reference github.event.inputs.tag consistently.

In `@DevOps-Project-41/.github/workflows/security.yml`:
- Around line 68-110: The workflow only builds and scans the api target; add
equivalent steps to build and scan the worker target within the trivy-image job:
duplicate the "Build API image for scanning" step but set target: worker and
tags: ai-worker:scan, then duplicate the "Scan API image with Trivy" step but
set image-ref: ai-worker:scan and output: trivy-image-worker.sarif and adjust
the upload step to use category: trivy-image-worker (you can keep the existing
final upload-artifact step as it already collects "*.sarif"). Ensure step names
match the new purpose so it's easy to find (e.g., "Build worker image for
scanning", "Scan worker image with Trivy") and reuse the same context/file/cache
settings as the api build.
- Around line 6-9: The security workflow's paths filter currently only triggers
on "DevOps-Project-41/app/**", "DevOps-Project-41/k8s/**", and the workflow file
itself; update the paths section in .github/workflows/security.yml to also
include the observability and security directories so changes to the new
collector and Kyverno policies trigger the scan (add patterns such as
"DevOps-Project-41/observability/**" and "DevOps-Project-41/security/**" or
broaden to "DevOps-Project-41/**" if appropriate), then commit the updated paths
block so the workflow runs on those changes.

In `@DevOps-Project-41/gitops/argocd-app-prod.yaml`:
- Around line 14-15: Update the Argo CD Git source settings: replace the
placeholder owner in repoURL (the value of repoURL) with the real GitHub
organization/user that owns the repo, and set targetRevision to "master" instead
of "main" so the Argo CD Application tracks the correct production branch
(modify the repoURL and targetRevision fields accordingly).

In `@DevOps-Project-41/k8s/base/api-deployment.yaml`:
- Around line 35-36: Replace the mutable image reference
"ghcr.io/GITHUB_REPOSITORY/ai-api:latest" with a concrete, immutable image
(either a released semver tag or an image digest like
"ghcr.io/.../ai-api@sha256:<digest>") so deployments are reproducible; update
the Deployment's container image field (the image value in the container spec)
and ensure your CI/CD pipeline injects the specific tag or digest into that
manifest (or use kustomize/helm templating) instead of leaving the
placeholder-style repository token and :latest.

In `@DevOps-Project-41/k8s/base/networkpolicy.yaml`:
- Around line 16-23: The egress rules in networkpolicy.yaml currently only list
ports (under the egress block) and lack any destination (`to`) selectors, which
allows ai-api and ai-worker to reach any IP on those ports; update the egress
rules to constrain destinations by splitting them into: in-cluster
destination-scoped rules (use podSelector/namespaceSelector or ipBlock for known
services like Redis, PostgreSQL, DNS, OTLP collectors) that target the specific
service endpoints for ports 6379, 5432, 4317, 4318, 53, and a separate explicit
external rule that only allows HTTPS (port 443) to a limited set of IP ranges or
FQDNs if required; modify the egress entries referenced in the
networkpolicy.yaml egress stanza so each port entry includes an appropriate `to`
clause (podSelector/namespaceSelector/ipBlock) to enforce least privilege for
ai-api and ai-worker.

In `@DevOps-Project-41/k8s/overlays/dev/kustomization.yaml`:
- Around line 23-31: Remove the plaintext literals from the dev kustomization
secret generators (the resources named "postgres-secret" and
"ai-platform-secrets") and replace them with references to external secret
sources (e.g., secretGenerator pointing to files or an env-file, or use
ExternalSecret/ sealed secret integration) so no credentials are committed;
update the kustomization to read values from a non-committed path instead of
embedding POSTGRES credentials and keep disableNameSuffixHash behavior if
needed; also add a .secrets/ entry to .gitignore and add corresponding .example
files (e.g., postgres.env.example) that document the required variables for
local/dev setup.

In `@DevOps-Project-41/observability/loki-values.yaml`:
- Around line 27-31: Remove traceId from the promtail labels block so it is not
indexed: in the promtail config where you currently have a labels: stanza
containing traceId and level, delete the traceId entry from that labels block
and keep only level; ensure traceId is still parsed by the json/pipeline stage
(so it remains available in the log message/metadata for querying but is not
promoted to a high-cardinality Loki label). Target the promtail "labels:" block
and the JSON extraction that produces "traceId"/"level" when making the change.

In `@DevOps-Project-41/observability/otel-collector.yaml`:
- Around line 79-81: The container spec for the otel-collector currently uses
the Kubernetes "command" field which overrides the image entrypoint and causes
the flag string to be executed as a binary; replace the "command" field with
"args" for the otel-collector container so the image entrypoint stays intact and
the flag "--config=/etc/otel/otel-collector-config.yaml" is passed as an
argument to the collector process.

In `@DevOps-Project-41/observability/prometheus-values.yaml`:
- Line 28: Remove the hardcoded adminPassword value and instead read the Grafana
admin password from a Kubernetes Secret or external secret manager; replace the
literal adminPassword: admin with a reference that sources the secret (e.g., use
a values key that points to a Secret name/key or a Helm lookup/secretKeyRef) and
document the required Secret name and key so deployments pull the credential at
runtime rather than storing it in git.

In `@DevOps-Project-41/security/policies/deny-privileged-containers.yaml`:
- Around line 23-27: The deny rule currently only checks
spec.containers[*].securityContext.privileged, so privileged initContainers or
ephemeralContainers can bypass it; update the policy pattern to include
equivalent checks for spec.initContainers[*].securityContext.privileged and
spec.ephemeralContainers[*].securityContext.privileged (or refactor to iterate
over all container lists) so that privileged: "true"/absent privileged cannot be
set in initContainers or ephemeralContainers in addition to spec.containers.

In `@DevOps-Project-41/security/policies/require-non-root.yaml`:
- Around line 23-30: The current anyPattern uses OR semantics so pod-level
spec.securityContext.runAsNonRoot: true can be used to bypass container
overrides and it also omits init/ephemeral containers; update the policy to
assert runAsNonRoot: true for every container type by (1) replacing the loose
spec.containers entry with list-anchored checks that require
securityContext.runAsNonRoot: true for all elements (use the list
existential/for-all anchor pattern such as ^ or the policy engine's "forAll"
list matcher) so a single container can't bypass the rule, (2) add equivalent
list-anchored checks for spec.initContainers and spec.ephemeralContainers, and
(3) keep/validate pod-level spec.securityContext.runAsNonRoot: true only as an
additional guard but not as a sole OR branch; reference anyPattern,
spec.securityContext.runAsNonRoot, spec.containers, spec.initContainers, and
spec.ephemeralContainers when making these changes.

In `@DevOps-Project-41/security/policies/require-resource-limits.yaml`:
- Around line 23-32: The policy currently only enforces resource requests/limits
under spec.containers; update the pattern to also require the same requests and
limits for spec.initContainers so init containers cannot omit CPU/memory. Modify
the pattern block that contains spec.containers to include an equivalent entry
for spec.initContainers (mirroring the resources.requests and resources.limits
checks for memory and cpu) or use a shared schema definition applied to both
"containers" and "initContainers" so both keys are validated for
requests.memory, requests.cpu, limits.memory, and limits.cpu.

In `@DevOps-Project-41/security/policies/restrict-latest-tag.yaml`:
- Around line 24-26: The Kyverno rule "no-latest-tag" currently only validates
spec.containers[].image which lets initContainers and ephemeralContainers bypass
the check and doesn't reliably prevent implicit untagged (implicit :latest)
images; update the policy to (1) apply the same image constraint to
images.containers, images.initContainers and images.ephemeralContainers (or
include spec.initContainers[].image and spec.ephemeralContainers[].image) and
(2) replace or augment the pattern check with a verifyImages block using
verifyDigest: true or a CEL/images-variable expression that enforces that every
image has either a non-latest tag or a digest (for example by validating image
contains "@" or matches a non-latest tag regex) so the rule enforces tag/digest
constraints across all container types.

In `@DevOps-Project-41/tests/load/k6-ai-jobs.js`:
- Around line 71-72: The code calls jobRes.json("status") without checking
jobRes.status, which will throw on non-200 or non-JSON responses; update the
polling logic around the http.get(`${BASE_URL}/jobs/${jobId}`) call to first
inspect jobRes.status (and optionally jobRes.headers["content-type"]) and only
call jobRes.json("status") when status === 200 and the body is JSON, otherwise
treat the poll as a failed/timeout attempt (e.g., set status to a
failure/timeout sentinel, record failure, and continue/retry); reference the
jobRes variable, the http.get(...) call, jobId and BASE_URL when locating the
change.
- Around line 74-82: The success metric is incremented before the result is
validated; move the successRate.add(true) so it only runs when the completed
response actually has a non-null result. In the block handling status ===
"completed" (see successRate.add, jobsCompleted.add, jobDuration.add, and the
check(jobRes, {"completed job has result": ...})), perform the check first (or
evaluate r.json("result") into a local variable) and then call
successRate.add(true) only if that validation passes; keep jobsCompleted.add(1)
and jobDuration.add(...) as-is.

---

Outside diff comments:
In `@DevOps-Project-41/k8s/base/networkpolicy.yaml`:
- Around line 1-69: Add a NetworkPolicy that selects pods with label
"app.kubernetes.io/name: postgres" (e.g., name it postgres-netpol) and restricts
ingress to only the pods that actually need DB access: create spec.policyTypes:
["Ingress"], spec.ingress allowing from podSelector(s) for the callers (for
example matchLabels app.kubernetes.io/name: ai-api, app.kubernetes.io/name:
ai-worker or app.kubernetes.io/part-of: ai-native-devsecops-platform) and allow
port: 5432 TCP; do not leave an open-wide ingress rule so the Postgres
StatefulSet pods are isolated.

---

Minor comments:
In `@DevOps-Project-41/.github/workflows/ci.yml`:
- Around line 9-13: The pull_request.paths filter in the GitHub Actions workflow
omits the workflow file itself (so changes to .github/workflows/ci.yml won't
trigger the workflow); update the pull_request.paths array in the ci.yml to
include the workflow file (for example add ".github/workflows/ci.yml" or broaden
to ".github/**") so that edits to the workflow are covered by the pull_request
trigger, ensuring the change is made in the pull_request.paths block of the
DevOps-Project-41/.github/workflows/ci.yml file.

In `@DevOps-Project-41/architecture/architecture.md`:
- Around line 47-55: The fenced diagram blocks (e.g., the block starting with
"Application SDK (OpenTelemetry) → OpenTelemetry Collector (OTLP gRPC) ..." and
the other blocks around lines 71-92 and 96-100) are unlabeled and trigger MD040;
fix by changing the opening triple backticks from ``` to ```text so each
diagram/flow code fence is explicitly labeled with the language "text".

In `@DevOps-Project-41/docs/cleanup.md`:
- Around line 60-61: The cleanup command using a pipe to xargs ("find . -name
\"bin\" -o -name \"obj\" | xargs rm -rf") is unsafe for paths with whitespace or
special characters; update the command to use a null-delimited approach (make
find emit NULs and pass to xargs with the NUL option) or switch to find's safe
execution form (use find's exec variant to remove matches directly), keeping the
existing target patterns (bin/obj) and the subsequent rm -f sbom-*.json
trivy-*.sarif TestResults/*.trx behavior unchanged.

In `@DevOps-Project-41/docs/gitops.md`:
- Around line 5-12: The fenced code block in docs/gitops.md lacks a language tag
(causing MD040); update the triple-backtick fence to include a language
identifier (e.g., change ``` to ```text) so the block becomes annotated (use
"text" as suggested) and re-run the linter to confirm the warning is resolved.
- Around line 39-43: The doc uses two different paths for the same manifest
(DevOps-Project-41/gitops/argocd-app-dev.yaml vs gitops/argocd-app-dev.yaml)
which can confuse users; update the instructions in the gitops.md lines
referencing the dev app so they use a single consistent path convention (choose
either repository-root-prefixed "DevOps-Project-41/gitops/argocd-app-dev.yaml"
or the relative "gitops/argocd-app-dev.yaml") and apply that same path in both
the kubectl apply and the surrounding prose where `argocd-app-dev.yaml` is
mentioned (references: the manifest name argocd-app-dev.yaml and the kubectl
commands shown).

In `@DevOps-Project-41/docs/observability.md`:
- Around line 5-12: The fenced code block containing the architecture sketch
starting with "Application (ai-api, ai-worker)" is unlabeled and triggers MD040;
add the language tag `text` to the opening fence (i.e., change the
triple-backtick that begins the block to ```text) so the block is explicitly
identified as plain text.

In `@DevOps-Project-41/docs/troubleshooting.md`:
- Around line 9-11: Several fenced code blocks in troubleshooting.md are missing
language identifiers (e.g., the blocks containing "Error response from daemon:
Ports are not available: exposing port TCP 0.0.0.0:8080",
"redis.ai-devsecops.svc.cluster.local:6379",
"Host=postgres;Database=aiops;Username=aiops;Password=aiops", and
"Host=postgres.ai-devsecops.svc.cluster.local;Database=aiops;Username=aiops;Password=<from-secret>");
add an appropriate info string (use text or bash) after each opening triple
backtick so markdownlint MD040 is satisfied and the examples remain unchanged.

In `@DevOps-Project-41/tests/smoke/smoke-test.sh`:
- Around line 56-63: The script makes two POSTs to the /ask endpoint (producing
ask_response and ask_status) but never validates ask_status; remove the
duplicate POST that sets ask_status and instead capture the HTTP status for the
original request that populates ask_response (e.g., use curl -w "%{http_code}"
with -o/-s to store both body and status from the single call). Update
references to ask_status to use the captured status from the same /ask request
so the test asserts response code and body deterministically.

🪄 Autofix (Beta)

Fix all unresolved CodeRabbit comments on this PR:

Push a commit to this branch (recommended)
Create a new PR with the fixes

ℹ️ Review info

⚙️ Run configuration

Configuration used: defaults

Review profile: CHILL

Plan: Pro

Run ID: ff97c3e6-8259-43c9-8b0e-944666f55505

📥 Commits

Reviewing files that changed from the base of the PR and between 1256980 and 35b4cdb.

⛔ Files ignored due to path filters (2)

DevOps-Project-41/architecture/diagrams/diagrams-1.png is excluded by !**/*.png
DevOps-Project-41/architecture/diagrams/diagrams-2.png is excluded by !**/*.png

📒 Files selected for processing (55)

DevOps-Project-41/.github/workflows/ci.yml
DevOps-Project-41/.github/workflows/release.yml
DevOps-Project-41/.github/workflows/security.yml
DevOps-Project-41/.gitignore
DevOps-Project-41/README.md
DevOps-Project-41/architecture/architecture.md
DevOps-Project-41/architecture/diagrams/architecture.mmd
DevOps-Project-41/docs/cleanup.md
DevOps-Project-41/docs/gitops.md
DevOps-Project-41/docs/observability.md
DevOps-Project-41/docs/security.md
DevOps-Project-41/docs/setup-cloud.md
DevOps-Project-41/docs/setup-local.md
DevOps-Project-41/docs/troubleshooting.md
DevOps-Project-41/docs/validation-checklist.md
DevOps-Project-41/gitops/argocd-app-dev.yaml
DevOps-Project-41/gitops/argocd-app-prod.yaml
DevOps-Project-41/infra/kind/kind-cluster.yaml
DevOps-Project-41/infra/terraform/README.md
DevOps-Project-41/k8s/base/api-deployment.yaml
DevOps-Project-41/k8s/base/api-service.yaml
DevOps-Project-41/k8s/base/configmap.yaml
DevOps-Project-41/k8s/base/keda-scaledobject-worker.yaml
DevOps-Project-41/k8s/base/kustomization.yaml
DevOps-Project-41/k8s/base/namespace.yaml
DevOps-Project-41/k8s/base/networkpolicy.yaml
DevOps-Project-41/k8s/base/postgres-statefulset.yaml
DevOps-Project-41/k8s/base/redis-deployment.yaml
DevOps-Project-41/k8s/base/secret.example.yaml
DevOps-Project-41/k8s/base/serviceaccount.yaml
DevOps-Project-41/k8s/base/worker-deployment.yaml
DevOps-Project-41/k8s/base/worker-service.yaml
DevOps-Project-41/k8s/overlays/dev/api-deployment-patch.yaml
DevOps-Project-41/k8s/overlays/dev/kustomization.yaml
DevOps-Project-41/k8s/overlays/dev/worker-deployment-patch.yaml
DevOps-Project-41/k8s/overlays/prod/api-deployment-patch.yaml
DevOps-Project-41/k8s/overlays/prod/kustomization.yaml
DevOps-Project-41/k8s/overlays/prod/worker-deployment-patch.yaml
DevOps-Project-41/observability/grafana-dashboard.json
DevOps-Project-41/observability/loki-values.yaml
DevOps-Project-41/observability/otel-collector-local.yaml
DevOps-Project-41/observability/otel-collector.yaml
DevOps-Project-41/observability/prometheus-local.yml
DevOps-Project-41/observability/prometheus-values.yaml
DevOps-Project-41/security/cosign.md
DevOps-Project-41/security/policies/deny-privileged-containers.yaml
DevOps-Project-41/security/policies/require-labels.yaml
DevOps-Project-41/security/policies/require-non-root.yaml
DevOps-Project-41/security/policies/require-resource-limits.yaml
DevOps-Project-41/security/policies/restrict-latest-tag.yaml
DevOps-Project-41/security/sbom.md
DevOps-Project-41/security/trivy.yaml
DevOps-Project-41/tests/load/k6-ai-jobs.js
DevOps-Project-41/tests/smoke/smoke-test.sh
README.md

✅ Files skipped from review due to trivial changes (19)

DevOps-Project-41/.gitignore
DevOps-Project-41/k8s/base/namespace.yaml
DevOps-Project-41/k8s/base/secret.example.yaml
DevOps-Project-41/k8s/base/configmap.yaml
DevOps-Project-41/security/policies/require-labels.yaml
DevOps-Project-41/infra/terraform/README.md
DevOps-Project-41/k8s/base/api-service.yaml
DevOps-Project-41/docs/validation-checklist.md
DevOps-Project-41/k8s/base/serviceaccount.yaml
DevOps-Project-41/gitops/argocd-app-dev.yaml
DevOps-Project-41/security/sbom.md
DevOps-Project-41/k8s/base/worker-deployment.yaml
DevOps-Project-41/README.md
README.md
DevOps-Project-41/security/cosign.md
DevOps-Project-41/docs/setup-local.md
DevOps-Project-41/architecture/diagrams/architecture.mmd
DevOps-Project-41/observability/grafana-dashboard.json
DevOps-Project-41/docs/setup-cloud.md

coderabbitai

♻️ Duplicate comments (3)

DevOps-Project-41/app/src/AiWorker/JobUpdater.cs (1)
11-13: ⚠️ Potential issue | 🟠 Major

Remove hardcoded database credentials fallback.

The hardcoded connection string "Host=localhost;Database=aiops;Username=aiops;Password=aiops" at line 13 remains a security risk and can route jobs to an unexpected database. Replace with an exception to fail fast when configuration is missing.
🤖 Prompt for AI Agents
Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

In `@DevOps-Project-41/app/src/AiWorker/JobUpdater.cs` around lines 11 - 13, The
constructor currently assigns _connectionString using
config.GetConnectionString("Postgres") ?? config["POSTGRES_CONNECTION_STRING"]
?? "Host=localhost;Database=aiops;Username=aiops;Password=aiops"; remove the
hardcoded fallback and instead throw a clear exception when neither config
source is present so the app fails fast; update the JobUpdater constructor to
validate that _connectionString is non-empty after checking
GetConnectionString("Postgres") and config["POSTGRES_CONNECTION_STRING"] and
throw (for example) an InvalidOperationException or ArgumentException with a
descriptive message if missing.
.github/workflows/project-41-ci.yml (2)
59-60: ⚠️ Potential issue | 🟠 Major | ⚡ Quick win

Disable persisted checkout credentials on second checkout.

The second checkout step at line 60 still uses actions/checkout@v4 without persist-credentials: false, leaving the auth token configured in the runner's git settings. Apply the same fix used for the first checkout.
🔒 Proposed fix
       - name: Checkout
         uses: actions/checkout@v4
+        with:
+          persist-credentials: false
🤖 Prompt for AI Agents
Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

In @.github/workflows/project-41-ci.yml around lines 59 - 60, The second GitHub
Actions checkout step still uses actions/checkout@v4 without disabling persisted
credentials; update that checkout step (the second "Checkout" step that calls
actions/checkout@v4) to include persist-credentials: false so the runner's git
auth token is not left configured after the checkout, matching the first
checkout step's configuration.
27-27: ⚠️ Potential issue | 🟠 Major | 🏗️ Heavy lift

Pin all GitHub Actions to full commit SHAs.

All actions in the workflow use floating tags (@v4, @v3, @v6) instead of immutable commit SHAs:

actions/checkout@v4 (lines 27, 60)

actions/setup-dotnet@v4 (line 32)

actions/upload-artifact@v4 (line 50)

docker/setup-buildx-action@v3 (line 63)

docker/build-push-action@v6 (lines 66, 78)

Replace each tag with the corresponding full commit SHA for supply chain security.

Also applies to: 32-32, 50-50, 60-60, 63-63, 66-66, 78-78
🤖 Prompt for AI Agents
Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

In @.github/workflows/project-41-ci.yml at line 27, The workflow uses floating
tags for third-party actions (actions/checkout@v4, actions/setup-dotnet@v4,
actions/upload-artifact@v4, docker/setup-buildx-action@v3,
docker/build-push-action@v6) which should be replaced with immutable full commit
SHAs; update each usage in the YAML to point to the corresponding commit SHA
(find the exact SHA on each action's GitHub releases/tags page or by inspecting
the tag ref) and ensure all occurrences (lines referencing actions/checkout,
actions/setup-dotnet, actions/upload-artifact, docker/setup-buildx-action,
docker/build-push-action) are replaced so the workflow pins to those SHAs.

🤖 Prompt for all review comments with AI agents

Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

Duplicate comments:
In @.github/workflows/project-41-ci.yml:
- Around line 59-60: The second GitHub Actions checkout step still uses
actions/checkout@v4 without disabling persisted credentials; update that
checkout step (the second "Checkout" step that calls actions/checkout@v4) to
include persist-credentials: false so the runner's git auth token is not left
configured after the checkout, matching the first checkout step's configuration.
- Line 27: The workflow uses floating tags for third-party actions
(actions/checkout@v4, actions/setup-dotnet@v4, actions/upload-artifact@v4,
docker/setup-buildx-action@v3, docker/build-push-action@v6) which should be
replaced with immutable full commit SHAs; update each usage in the YAML to point
to the corresponding commit SHA (find the exact SHA on each action's GitHub
releases/tags page or by inspecting the tag ref) and ensure all occurrences
(lines referencing actions/checkout, actions/setup-dotnet,
actions/upload-artifact, docker/setup-buildx-action, docker/build-push-action)
are replaced so the workflow pins to those SHAs.

In `@DevOps-Project-41/app/src/AiWorker/JobUpdater.cs`:
- Around line 11-13: The constructor currently assigns _connectionString using
config.GetConnectionString("Postgres") ?? config["POSTGRES_CONNECTION_STRING"]
?? "Host=localhost;Database=aiops;Username=aiops;Password=aiops"; remove the
hardcoded fallback and instead throw a clear exception when neither config
source is present so the app fails fast; update the JobUpdater constructor to
validate that _connectionString is non-empty after checking
GetConnectionString("Postgres") and config["POSTGRES_CONNECTION_STRING"] and
throw (for example) an InvalidOperationException or ArgumentException with a
descriptive message if missing.

ℹ️ Review info

⚙️ Run configuration

Configuration used: defaults

Review profile: CHILL

Plan: Pro

Run ID: 669f22ce-3f22-49be-aa19-317c7d8ad654

📥 Commits

Reviewing files that changed from the base of the PR and between 35b4cdb and 382b6b0.

📒 Files selected for processing (2)

.github/workflows/project-41-ci.yml
DevOps-Project-41/app/src/AiWorker/JobUpdater.cs

coderabbitai

Actionable comments posted: 1

♻️ Duplicate comments (1)

DevOps-Project-41/app/src/AiWorker/JobUpdater.cs (1)

23-24: ⚠️ Potential issue | 🟠 Major | ⚡ Quick win

Affected rows not checked — previous fix not applied.

Despite being marked as addressed in a prior review, the code still ignores the ExecuteNonQueryAsync() return value. If job_id is invalid or missing, this method silently succeeds, causing state drift between the worker's belief and actual database state.

Proposed fix

-        await cmd.ExecuteNonQueryAsync();
+        var affected = await cmd.ExecuteNonQueryAsync();
+        if (affected != 1)
+            throw new InvalidOperationException($"MarkProcessing failed for job_id={jobId}. Rows affected: {affected}");

🤖 Prompt for AI Agents

Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

In `@DevOps-Project-41/app/src/AiWorker/JobUpdater.cs` around lines 23 - 24, The
call to await cmd.ExecuteNonQueryAsync() in JobUpdater.cs is ignoring the
returned affected-row count; change it to capture the result (e.g., int affected
= await cmd.ExecuteNonQueryAsync()), then check affected > 0 and handle the
failure case (throw an exception or log and return an error) so that when job_id
is invalid or missing the code does not silently proceed; update any surrounding
method (the JobUpdater method that executes the command) to propagate or handle
this error path accordingly.

🤖 Prompt for all review comments with AI agents

Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

Inline comments:
In `@DevOps-Project-41/app/src/AiWorker/JobUpdater.cs`:
- Around line 41-42: In MarkCompletedAsync and MarkFailedAsync, capture the
result of await cmd.ExecuteNonQueryAsync() into an int (e.g., affected) and
validate it; if affected == 0 then throw or surface an exception (for example
InvalidOperationException) that includes the job identifier (jobId) and
operation (completed/failed) so a silent success is avoided—update both methods
(references: MarkCompletedAsync, MarkFailedAsync, ExecuteNonQueryAsync, cmd) to
perform this check and return/continue only when affected > 0.

---

Duplicate comments:
In `@DevOps-Project-41/app/src/AiWorker/JobUpdater.cs`:
- Around line 23-24: The call to await cmd.ExecuteNonQueryAsync() in
JobUpdater.cs is ignoring the returned affected-row count; change it to capture
the result (e.g., int affected = await cmd.ExecuteNonQueryAsync()), then check
affected > 0 and handle the failure case (throw an exception or log and return
an error) so that when job_id is invalid or missing the code does not silently
proceed; update any surrounding method (the JobUpdater method that executes the
command) to propagate or handle this error path accordingly.

🪄 Autofix (Beta)

Fix all unresolved CodeRabbit comments on this PR:

Push a commit to this branch (recommended)
Create a new PR with the fixes

ℹ️ Review info

⚙️ Run configuration

Configuration used: defaults

Review profile: CHILL

Plan: Pro

Run ID: 469f5501-65d4-445d-9661-538a3399f163

📥 Commits

Reviewing files that changed from the base of the PR and between 382b6b0 and 4144428.

📒 Files selected for processing (1)

DevOps-Project-41/app/src/AiWorker/JobUpdater.cs

dotnet core app

67077dd

oleitao added 4 commits May 22, 2026 23:27

github

1256980

github workflow

a15d52d

README.md

49d5b2c

end-to-end AI devsecops with gitops delivery flow

35b4cdb

coderabbitai Bot reviewed May 22, 2026

View reviewed changes

fix

382b6b0

coderabbitai Bot reviewed May 22, 2026

View reviewed changes

oleitao added 2 commits May 23, 2026 22:14

revert fix

4144428

fix

e2e0b5b

coderabbitai Bot reviewed May 23, 2026

View reviewed changes

Comment thread DevOps-Project-41/app/src/AiWorker/JobUpdater.cs

Uh oh!

Conversation

oleitao commented May 22, 2026 • edited by coderabbitai Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary by CodeRabbit

Uh oh!

coderabbitai Bot commented May 22, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Reviews paused

Walkthrough

Changes

Uh oh!

coderabbitai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

coderabbitai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

coderabbitai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

coderabbitai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

oleitao commented May 22, 2026 •

edited by coderabbitai Bot

Loading

coderabbitai Bot commented May 22, 2026 •

edited

Loading