Kelzodi blog: AI in Practice: October 2025

Sunday, October 19, 2025

The Missing Half of Digital Delivery: A Quality Pipeline for the Platform

Most teams have an Application Pipeline. Few have a Quality Pipeline that keeps the platform application servers, security providers, and business databases synchronized and identical across Dev, QA, and Prod. That gap is why the same release behaves differently in each environment.

What is the Quality Pipeline?

A separate, always-on pipeline that versions, tests, and promotes the runtime itself:

Application servers & domains (WebLogic/SOA/BPM/WCC): versions, patches, server groups, datasources, JMS, logging, SSL, proxies.
Security providers: LDAP realms, authentication chains, roles, and mappings—promoted as versioned policy, not manual steps.
Business databases: schemas and reference data promoted as versioned changes (not ad-hoc scripts).

It’s not documentation. It’s a repeatable, auditable system that makes QA and Prod match the reference DEVevery time.

Why it must be separate from the App Pipeline

Different lifecycles. Platform patches and security baselines move on their own cadence; apps should not be blocked or destabilized by them.
Different owners & controls. Platform belongs to platform/infra/security; apps belong to product teams. Separation clarifies accountability and approvals.
Different quality gates. Platform gates are about safety and conformity (patch levels, cipher suites, access control); app gates are about features and behavior.

What the Quality Pipeline does (in plain language)

Freezes a “golden” platform in the reference DEV environment (the one source of truth).
Runs its own tests (health, connectivity, security, performance smoke) on the platform—not the app.
Promotes the exact platform version to QA/Prod with a click keeping environment-specific items (IPs, proxies, credentials) safely parameterized.
Generates a compliance report (what changed, who approved, what passed), and can roll back instantly if a gate fails.

What it synchronizes precisely

Application servers: version & patch level, domain configuration, clusters, ports, JDBC/JMS, logging/auditing, SSL/proxy settings.
Security providers: directory connections, authentication chains, roles/groups, policy mappings.
Business DB: schema changes and curated reference data (never live production data).
Never: runtime instance data (SOA audit, in-flight messages, JMS stores, transaction logs).

Business outcomes (why leaders should care)

Fewer incidents, faster releases. Identical platforms mean QA results predict Prod. Typical teams see 30–50% lower MTTR and far fewer change failures.
Lower risk & better audits. Every platform change is versioned, approved, and automatically tested.
Happier teams. No more “works on Dev, breaks on QA” firefights; fewer weekend rollbacks.

Replace document-driven procedures with a Quality Pipeline

Old way (fragile): long runbooks, manual patching, step drift, tribal knowledge.
New way (reliable): versioned platform definitions, automated promotion, built-in tests & rollback.

Topic	Document-driven	Quality Pipeline (separate)
App servers & domains	Manual edits per env	Versioned definition, promoted identically
Security providers	Hand-tuned, inconsistent	Policy bundles, consistent across envs
Business DB	Ad-hoc scripts	Versioned changesets & reference data
Evidence	Meeting minutes	Automated report: what/when/who/tests
Rollback	Best effort	Single-click to last known-good

How to start (business-friendly, low risk)

Nominate a reference DEV environment for the platform (servers, security, DB schema).
Stand up the Quality Pipeline just for the platform—no app changes yet.
Pilot one promotion (Dev → QA) of a platform patch and security update; measure incidents and time saved.
Institutionalize: platform releases get a version/tag and move forward on their own cadence, apps depend on a declared minimum platform version.

Talking points for stakeholders

“We’ll stop treating servers, security, and schemas as paperwork and start treating them as products we can version, test, and promote.”
“A separate Quality Pipeline guarantees QA and Prod behave like the reference DEV no surprises, no heroics.”
“Compliance improves because every change produces an automatic, signed report of exactly what shipped.”

Conclusion: The Application Pipeline gets features out the door. The Quality Pipeline makes sure every environment runs them the same way safely, repeatedly, and audibly. Keep them separate on purpose.

Tools that make the Quality Pipeline real (one-click, identical-but-parameterized)

Source of truth & CI/CD:
Git (GitHub/GitLab/Bitbucket) for versioning every platform change; Jenkins / GitLab CI / GitHub Actions to run the one-click promotion (build → test → promote → report).
Artifact & image management:
Nexus/Artifactory for storing signed artifacts (Golden Oracle Home tar/images, SAR/EAR, CMU bundles); optional Packer to bake a “Golden Oracle Home” image with exact patches.
Infrastructure & configuration as code:
Terraform (provision VMs/network) + Ansible (OS prereqs, reverse proxies, templated configs).
WebLogic Deploy Tooling (WDT) to define and promote domain configuration (clusters, JDBC/JMS, SSL, logging, work managers) with per-environment variables (IPs, proxies, ports).
(Optional) WebLogic Kubernetes Operator if you standardize on Kubernetes later.
Application-server ecosystem automation:
WLST/ANT or REST for deploying SOA/BPM packages; OPatch scripted for patch baselines; MDS exporters/importers versioned in Git.
WebCenter Content (WCC):
CMU (Configuration Migration Utility) for exporting/importing WCC configuration bundles; Archiver/Replication for scoped content moves (not full prod copies).
Business database:
Liquibase (or Flyway) to promote schema & reference data as versioned changesets with environment “contexts” (Dev/QA/Prod).
Secrets & certificates:
Vault/KMS to inject passwords, keystores, and tokens at deploy time no secrets in Git.
Quality & compliance gates:
Trivy/Grype for SBOM & CVE scans of the Golden Home image; OPA/Conftest to enforce policy (e.g., TLS1.2+, no admin over HTTP); automated smoke & functional tests (Postman/SoapUI, Selenium/Playwright).
Observability & drift control:
WLDF/ODL configs versioned in WDT; Elastic/EFK or Prometheus/Grafana for logs/metrics; WDT discoverDomain diffs and lsinventory checks to prove targets match the reference.

Result: QA and Prod become identical where they must be identical binaries, patch levels, domain config, security providers, logging/audit policies while preserving environment-specific settings (IPs/hostnames, proxies, credentials, URLs) via variables and secrets. One click promotes the proven reference DEV platform forward, runs health/tests automatically, and produces an auditable report fast, predictable, and reversible.

Tuesday, October 14, 2025

Evolution of Operational Maintenance: From Reactive to Predictive and Proactive Models

Many established companies are now questioning a long-standing imbalance in IT operations: too much money is spent on reactive activities, and not enough on preventive or proactive ones.

This discussion is not new, but it has gained tremendous importance in recent years as organizations realize that operational reactivity consumes valuable talent and prevents innovation.

1. The Core of the Discussion

In many organizations, IT departments and suppliers still operate under a reactive paradigm waiting for incidents to occur, then mobilizing resources to fix them.

However, companies are increasingly recognizing that:

reactive work is costly,
it reduces operational resilience, and
it does not generate value, only damage control.

As a result, the conversation is shifting toward building preventive and predictive maintenance capabilities, where failures are avoided rather than simply repaired.

This topic has become one of the central pillars of modern IT operations management, deeply embedded in frameworks such as IT Service Management (ITSM), DevOps, AIOps, and Site Reliability Engineering (SRE).

2. Misaligned Incentives: Clients vs. Service Providers

One of the most controversial aspects of this transformation lies in the incentive structure between service providers and client organizations.

Service Providers

Often prefer reactive maintenance models because they are simpler and more profitable.
Incident-based billing (hourly or per ticket) creates a direct financial incentive to maintain a steady flow of issues rather than eliminate their root causes.
Reactive support requires less strategic investment in automation, predictive monitoring, or process redesign.
Contracts usually focus on SLA response times, not on measurable reduction of incidents or improvements in system resilience.

Client Organizations

Want the opposite: fewer failures, more stability, and more automation.
Understand that every unplanned outage or repeated issue has a hidden cost production delays, lost productivity, compliance risks, and staff burnout.
View reactive maintenance as a symptom of operational immaturity, not an achievement.

This structural misalignment has become a recurring theme in executive IT committees, where CIOs and CTOs are asking hard questions about the real value of their outsourcing models.

3. How Companies Are Addressing It

Forward-looking organizations are starting to redefine their maintenance contracts, metrics, and cultural approach to operations.

Some of the most common shifts include:

Contract Redesign

Moving from “pay-per-incident” models to “pay-per-stability” or “continuous improvement” models.
Introducing KPIs for yearly reduction of critical incidents.
Adding bonus mechanisms for automation and self-healing deployments.

Governance and Process Audits

Integrating maturity assessments (ITIL, COBIT, Lean IT) that check whether vendors are truly performing Problem Management, not just Incident Management.
Requiring root-cause analysis documentation for recurring failures.
Establishing governance boards to review incident repetition patterns and enforce preventive actions.

Operational Transparency

Clients increasingly deploy their own observability platforms to monitor uptime, logs, and performance metrics directly.
This transparency limits the ability of providers to report selectively and empowers the client with data-driven accountability.

4. The Ethical and Strategic Dimension

A growing number of CIOs now articulate the dilemma in simple but powerful terms:

“If the service provider earns money every time something breaks, why would they want the system to be stable?”

This analogy mirrors the healthcare model if doctors were paid only when patients are sick, prevention would never advance.
Hence the movement toward “value-based IT operations”, where success is defined not by the number of issues resolved, but by the number of issues avoided.

From a strategic standpoint, this also touches upon:

Vendor dependency and the erosion of internal technical knowledge,
The difficulty of introducing automation in legacy outsourcing contracts, and
The need for shared accountability between client and provider.

5. The Emerging Shift: Toward Proactive, Predictive, and Autonomous Maintenance

Leading organizations; Airbus, CGI, Repsol, or major public administrations are embracing a multi-stage evolution:

Maintenance Model	Description	Example	Business Impact
Reactive	Respond after a failure	Restarting a crashed server	Restores service but no improvement
Preventive	Scheduled maintenance	Rotating logs, cleaning caches	Reduces minor failures
Proactive	Data-driven anticipation	Detecting disk saturation trends	Avoids major incidents
Predictive	AI/ML anticipates failures	ML model forecasts performance degradation	Prevents critical outages
Autonomous	Self-healing systems	Kubernetes auto-restarts and scales	High resilience, minimal human input

The goal is to progress along this maturity curve by combining data, automation, and AI into the operational core.

6. A Debate That Reaches the Boardroom

This topic is no longer confined to technical teams.
It is increasingly present in:

IT governance boards and Change Advisory Boards (CAB),
Digital transformation committees,
Outsourcing renegotiations, and
R&D programs focusing on AI, AIOps, and automation.

Executives see operational proactivity not just as a technical goal, but as a strategic enabler of innovation and cost efficiency.

7. Conclusion

The transition from reactive to predictive operations represents a cultural and economic turning point in the IT industry.

While service providers have traditionally benefited from reactive maintenance, the most mature organizations are shifting the narrative measuring success by stability, resilience, and continuous improvement rather than the number of incidents resolved.

This evolution is powered by:

Automation,
Artificial Intelligence, and
A shift in mindset: from firefighting to foresight.

Ultimately, the companies that embrace proactive and predictive maintenance models will not only spend less on operational chaos they will also unlock the freedom to innovate faster, safer, and smarter.

Thursday, October 2, 2025

Closing the Gaps in Industry: From Bureaucracy to Data-Centered Agility

In manufacturing industries one of today’s biggest challenges is closing the knowledge and data gaps between business management, procurement, production, and IT systems teams.

Traditionally, operational implementation has been driven by the type of information that needed to be managed, but with systems that are extremely costly to implement, difficult to maintain, data-redundant, and overly fragmented across applications. The result: bureaucracy, lack of continuity, and a loss of shared vision.

Today, replacing those core systems (OLTP, ERP, PLM, CRM…) is unfeasible. The real path forward is different: reusing what already exists, but in an agile, cost-effective, data-centric way.

A New Approach: From Systems to Data

Current trends show we need to give less importance to classical applications and more to ultra-fast layers of integration and analysis, where:

Data is the center of gravity.
AI is applied to operational management (not design or engineering).
The key is to close gaps through small, agile developments that deliver value quickly.

This enables:

Natural language queries.
Self-documentation systems.
Lightweight and flexible user interfaces.
Projects where technical teams and business experts collaborate seamlessly.

Case Study 1: Interface Documentation Assisted by AI

One of the most critical pain points in industry is the lack of clear and updated documentation of interfaces between systems.

The problem: Procurement, production, logistics, and quality systems exchange data through dozens of interfaces. Documentation is often outdated or missing, and knowledge sits in the heads of a few specialists.
The consequence: Excessive dependency, costly integration projects, and limited visibility for business leaders.

How AI can help

Automatic inventory of APIs, logs, messages, and database links to generate an initial interface map.
Dynamic documentation generation where technical details are translated into business language (e.g. “The Procurement system sends the daily parts list to Production in JSON format”).
Continuous updates so documentation evolves with each change.
Natural language queries, e.g. “Which systems consume real-time production data?”, returning a clear diagram.

Benefits

Closes a critical gap between IT and business.
Reduces bureaucracy and dependency on individuals.
Accelerates decisions and new integration projects.
Builds the foundation of a more agile, data-driven enterprise.

Case Study 2: Git and Jira. A Knowledge System with RAG

Another strategic opportunity is applying generative AI with Retrieval-Augmented Generation (RAG) on top of Git and Jira.

How it works

Git: the system ingests code repositories, documentation, commit history, and recent changes.
Jira: it ingests issues, user stories, tasks, comments, attachments, and workflows.
The content is normalized, chunked into manageable fragments, and indexed using both semantic (vector-based) and keyword search.
A user can then ask in natural language:
- “Which commit changed the login validation?”
- “What issues are blocking the current sprint delivery?”
- “Who last modified the payments module?”
The system retrieves the relevant fragments and feeds them to the generative AI, which produces a clear, contextualized answer with links back to Git commits or Jira tickets.

What is automated

Documentation: summaries of commits, issues, and project changes.
Traceability: linking code changes with the Jira tasks that motivated them.
Cross-search: a single point to query both Git and Jira without switching tools.
Smart notifications: alerts for dependencies, blockers, or critical changes.
Automated reporting: daily or weekly project status summaries.

Benefits

Saves time: no need to manually search across repositories and projects.
Improves collaboration: both business and technical users can ask questions in plain language.
Reduces risk: better visibility of dependencies between code and tasks.
Keeps documentation alive: always up-to-date, without extra manual effort.
Faster decision-making: managers can ask “Which tasks are blocked by code dependencies?” and get immediate answers.

Conclusion

Digital transformation in industry is not about replacing legacy systems, but about closing the knowledge and data gaps with agile, data-driven, AI-assisted solutions.

The first step may be to improve interface documentation with AI.
The second step could be applying RAG on top of operational repositories like Git and Jira.

Both approaches empower teams to collaborate better, reduce bureaucracy, and unlock faster, more informed decision-making.

The industries that succeed in combining agility, data, and applied AI for operations will be the most competitive in the years ahead.