Wazuh Services Table Integration: A Comprehensive Guide

Aug 4, 2025 by Omar Yusuf 56 views

Integrating Services Table into Wazuh DB and Inventory Harvester

Introduction

Hey guys! We're diving into an exciting project: integrating a unified services table into Wazuh DB and Inventory Harvester. This is all about making system service data more accessible, consistent, and useful across different platforms. Think of it as building a super-organized library for all your service information, making it easier to find exactly what you need, when you need it. This article details the functional requirements, table structure, and tasks involved in this integration, ensuring a smooth and efficient implementation. By aligning with the Elastic Common Schema (ECS), we're making sure this data plays nicely with other tools and systems, giving you a holistic view of your environment.

Wazuh Version and Components

Wazuh version	Component(s)	Install type	Install method	Platform
4.14.0-alpha0	Wazuh DB, InventoryHarvester	Manager	Packages/Sources	All

We're focusing on Wazuh version 4.14.0-alpha0, specifically targeting the Wazuh DB and Inventory Harvester components. These are crucial for managing and collecting data about system services. We’re using a Manager install type, and the installation method can be either Packages or Sources, giving us flexibility across all platforms. This ensures that regardless of your setup, you can benefit from these enhancements. The goal here is to create a robust and versatile system that fits seamlessly into your existing infrastructure. So, whether you're a seasoned Wazuh user or just starting out, this integration will streamline how you handle service data. By supporting various installation methods and platforms, we're making it as accessible as possible for everyone to take advantage of these new capabilities. This detailed compatibility overview ensures that you can confidently implement these changes, knowing they are designed to work with your setup.

Description

The main goal here is to support the unified ECS-aligned schema for system services. We need to implement this in both Wazuh DB and Inventory Harvester. This will allow us to handle, store, and index service inventory information across all supported platforms. The data includes system services or daemons installed or running on endpoints. The aim is to make this data normalized and easily queryable across different operating systems. Think about how much easier it will be to track and manage your services when everything is in a consistent format! This unified approach means you won't have to juggle different data formats or structures, making your life as an admin or security analyst much simpler. We’re essentially creating a single source of truth for service information, which is incredibly powerful for troubleshooting, compliance, and overall system health monitoring. The benefits of this normalization extend beyond just ease of use; it also enhances the accuracy and reliability of your data, ensuring you're making informed decisions based on a solid foundation. So, get ready to say goodbye to data silos and hello to a streamlined, unified view of your system services!

Functional Requirements

Let's break down the functional requirements for this project:

Implement a unified `services` table aligned with ECS

First off, we need to implement a unified services table. This table will be aligned with the Elastic Common Schema (ECS). ECS provides a common set of fields for data, which makes it easier to analyze data from different sources. This is a big deal because it means all your service data will be standardized, making it a breeze to search, filter, and correlate. Imagine trying to compare apples and oranges – that's what it's like dealing with non-standardized data. By adopting ECS, we're turning those apples and oranges into uniform fruit slices, ready for easy comparison and analysis. This standardized table will be the backbone of our service data management, ensuring consistency and clarity across the board. So, when you're digging into your service data, you'll have the confidence that everything is structured in a predictable and logical way, saving you time and headaches.

Inventory information must be retrievable via Wazuh's API and CLI

Next, the inventory information must be retrievable via Wazuh's API and CLI. This means you can access the service data programmatically or through command-line tools. This flexibility is key for automation and integration with other systems. Think about how you can script tasks or pull data into dashboards with ease. Having both API and CLI access gives you the best of both worlds – programmatic control for automated tasks and command-line convenience for quick checks and manual operations. This accessibility ensures that service data is always at your fingertips, no matter how you prefer to work. Whether you're building a custom monitoring solution or just need to grab some quick stats, the API and CLI access points make it simple and efficient. This is all about empowering you to work with your data in the way that best suits your needs.

Global queries must support filtering and aggregation on this new data

Global queries must support filtering and aggregation on this new data. This is essential for slicing and dicing the data to get meaningful insights. You'll be able to filter services by status, name, or any other relevant field. Aggregation will let you count services, find averages, and more. Imagine being able to quickly see how many services are running on a particular host or finding the average CPU usage across all services. These capabilities are critical for effective monitoring and troubleshooting. With robust filtering and aggregation, you can zoom in on specific issues or zoom out for a high-level overview, giving you complete control over your data analysis. This means you can go from a mountain of raw data to actionable intelligence in just a few clicks, making your life as a security pro or system admin infinitely easier.

The Inventory Harvester must send this data to the indexer

Finally, the Inventory Harvester must send this data to the indexer. This ensures that the service data is available for searching and analysis in Wazuh. The indexer is the heart of Wazuh's data processing pipeline, so getting the data there is crucial. This step ensures that all the hard work of collecting and normalizing the service data pays off by making it readily available for use. Think of the Inventory Harvester as the data courier, reliably delivering the goods to the indexer for processing. Once the data is indexed, it becomes searchable and analyzable, unlocking a wealth of insights into your system services. This seamless flow of data from collection to indexing is what makes Wazuh such a powerful tool for security monitoring and incident response. So, you can rest assured that your service data will be efficiently processed and ready for action whenever you need it.

Table Structure

The unified table consolidates data from various platform-specific sources. Think systemctl on Linux, launchctl on macOS, and sc query on Windows. It aligns this data to the Elastic Common Schema (ECS) where applicable. This alignment is key for consistency and interoperability. The ECS is like a universal language for security data, ensuring that different systems can understand and work with the same data in a consistent way. By mapping our service data to ECS fields, we're making it easier to integrate with other tools and platforms that also use ECS. This means you can correlate service data with other types of security events, giving you a more comprehensive view of your environment. The goal here is to create a table structure that is not only robust and efficient but also plays well with others in the security ecosystem. So, you can think of this as building a data table that speaks the same language as all your favorite security tools.

The proposed schema for this table is detailed in this comment:

wazuh-agent#807 (comment)

Be sure to check out the link for the nitty-gritty details of the schema. This comment thread is where the magic happens – it's where the specific fields, data types, and relationships are defined. Diving into this level of detail is crucial for understanding how the service data will be structured and stored. The schema essentially lays out the blueprint for our data table, ensuring that everything fits together logically and efficiently. It's like having a detailed architectural plan before you start building a house. By reviewing the proposed schema, you'll get a clear picture of how the different pieces of service information will be organized, making it easier to work with the data down the line. So, don't skip this step – it's your insider's guide to the structure of the services table!

Tasks

Alright, let's get down to the tasks at hand. We've got two main areas to focus on: Wazuh DB and Inventory Harvester. Each has its own set of challenges and goals.

Wazuh DB

Implement the `services` table schema in Wazuh DB

First up, we need to implement the services table schema in Wazuh DB. This means creating the actual database table with all the fields defined in the schema we talked about earlier. Think of it as laying the foundation for our service data. This step involves defining the data types, constraints, and relationships within the database. It's like setting up the infrastructure to support all the service information we'll be collecting. A well-designed table schema is crucial for efficient data storage and retrieval, so we need to get this right. This implementation ensures that our database is ready to handle the influx of service data and that we can query it effectively. So, this task is all about building a solid and scalable foundation for managing service information within Wazuh.

Make the data queryable from the Wazuh API and CLI inventory

Next, we need to make the data queryable from the Wazuh API and CLI inventory. This ensures that users can access the service data through both programmatic interfaces and command-line tools. This is where we hook up the database to the outside world, allowing users and systems to interact with the service data. Making the data queryable involves writing the necessary code to expose the data through the API and CLI. This includes defining the endpoints, handling requests, and formatting the responses. The goal is to provide a seamless and intuitive experience for users, regardless of how they choose to access the data. So, whether you're building a custom dashboard or running a quick command, you'll be able to get the service data you need, when you need it.

Ensure ECS field naming and relationships are respected

It's crucial to ensure that ECS field naming and relationships are respected. This guarantees consistency and compatibility with other systems that use ECS. Remember, ECS is our universal language for security data, so we need to speak it fluently. This task involves carefully mapping the fields in our services table to their corresponding ECS fields. We also need to ensure that the relationships between fields are correctly represented in the database. This meticulous attention to detail ensures that our service data is not only well-organized but also easily integrated with other ECS-compliant systems. So, by respecting ECS, we're making our data more valuable and versatile, ready to play nicely with the rest of the security ecosystem.

Write Unit Tests for the new database logic

Of course, we need to write Unit Tests for the new database logic. This ensures that our code works as expected and that we catch any bugs early on. Testing is a critical part of software development, and it's especially important when dealing with databases. Unit tests allow us to isolate and test individual components of our code, ensuring that each piece works correctly in isolation. This helps us catch errors before they make their way into the production system. By writing comprehensive unit tests, we can have confidence in the reliability and correctness of our database logic. So, this task is all about quality assurance, making sure our database is solid and dependable.

Inventory Harvester

Add support to InventoryHarvester to extract and send `services` data

Moving on to the Inventory Harvester, we need to add support to extract and send services data. This involves modifying the Harvester to collect service information from the endpoints and format it according to our unified schema. The Inventory Harvester is like our data collection agent, gathering information from various sources and sending it to the central system. This task requires us to implement the logic for querying the system for service information, parsing the results, and mapping them to the ECS-aligned fields. We also need to ensure that the Harvester can efficiently handle the volume of service data without impacting performance. So, this task is about expanding the Harvester's capabilities to include service data collection, ensuring that we have a comprehensive view of the services running on our endpoints.

Use the field mapping defined in wazuh-indexer#1055

We'll be using the field mapping defined in wazuh-indexer#1055. This ensures that the data is indexed correctly in Wazuh. This field mapping is our Rosetta Stone, translating the service data into a format that the Wazuh indexer can understand. By adhering to this mapping, we ensure that the data is indexed correctly, making it searchable and analyzable within Wazuh. This step is crucial for data integrity and usability. It's like having a well-defined dictionary that ensures everyone is on the same page when it comes to data interpretation. So, by following the field mapping, we're ensuring that our service data is ready for prime time, fully integrated into the Wazuh ecosystem.

Develop and validate Unit Tests

Finally, we need to develop and validate Unit Tests. Just like with Wazuh DB, testing is crucial for ensuring the reliability of the Inventory Harvester. These tests will verify that the Harvester is correctly collecting, formatting, and sending service data. Unit tests allow us to isolate and test individual components of the Harvester, ensuring that each part is working as expected. This helps us catch bugs early on and prevent them from making their way into the production system. By writing comprehensive unit tests, we can have confidence in the Harvester's ability to collect and process service data accurately and efficiently. So, this task is all about building a robust and dependable data collection mechanism.

Conclusion

So there you have it, guys! Integrating the services table into Wazuh DB and Inventory Harvester is a big step towards unified and accessible service data. By following these tasks and guidelines, we'll create a robust system for managing service information across all platforms. This project is all about making your life easier, giving you a clearer view of your environment, and empowering you to make informed decisions. Stay tuned for more updates as we move forward with this exciting integration!