ML Engine

ML Engine#

When deployed as part of a Plixer One Enterprise environment, the Plixer ML Engine applies anomaly and threat detection techniques to the network data collected by Scrutinizer.

Note

To learn more about Plixer One Enterprise licensing options, contact Plixer Technical Support.

This configuration guide introduces the capabilities of the ML Engine and provides further information on managing the various settings controlling its functions and behavior.

On this page:

Overview

Inclusion & exclusion rules

Managing inclusion and exclusion rules

Managing dimensions

Global ML settings

ML cluster settings

Overview#

Once deployed and configured, the engine is able to ingest flow data through Scrutinizer and apply multiple machine learning techniques to identify potentially problematic activity on the network.

The Plixer ML Engine has several key functions that enable intelligent, multi-layered anomaly and threat detection in a Plixer One Enterprise deployment:

Comprehensive network behavior modeling: Leveraging the large volumes of flow data collected by Scrutinizer, the engine is capable of building behavioral models encompassing network activity at any scale. It can then learn to recognize deviations and suspicious activity, such as data accumulation/exfiltration, tunneling, and lateral movement, that may indicate an attack on the network.
Accessible behavioral insights for network assets: After being alerted to anomalous behavior, network and security teams can drill down into the associated hosts, IP address groups, and/or exporter interfaces to better understand the details of their involvement in the reported detection.
Highly configurable ML modeling: The ML Engine monitors network activity based on user-customizable dimensions and inclusion/exclusion rules. Consistently repeated traffic patterns, asset/group importance, and data seasonality are all taken into consideration as well, resulting in models that are uniquely tailored to each environment.
ML-based malware detection: Using pre-trained classification models, the engine is able to recognize generic activity patterns that are associated with common classes of malware, including command and control, remote access trojans, and exploit kits. This adds another layer of protection to further reduce risk and mean time to resolution (MTTR) when threats are detected.
Continuous observation and learning: As it ingests additional flow data, the ML Engine updates its behavior models based on a schedule that defines weekdays, weeknights, and weekends to account for changes in legitimate activity patterns and improve recognition of advanced threats that attempt to disguise their behavior.

Managing inclusion and exclusion rules#

To ensure that its behavior models represent only relevant network activity, the Plixer ML Engine can be uniquely tailored to its environment using custom rules defining inclusions and exclusions for its functions. These rules can be managed from the Admin > Alarm Monitor > ML Rules view of the Scrutinizer web interface.

Inclusion rules#

An inclusion rule defines either a network address (hosts/subnets) or exporter interface as a network data source for the ML Engine. Each rule also includes a sensitivity setting (see below) that is applied to the asset specified.

Malware detection, which uses pre-trained classification models to recognize generic malware behaviors, can also be enabled for individual inclusions.

Inclusion sensitivity#

An inclusion’s sensitivity setting can be used to tune the engine’s tolerance for behavioral deviations for the host/subnet or exporter interface. Lowering the sensitivity setting for an asset will cause even minor deviations to be reported as detections, resulting in a higher volume of alarms. Conversely, increasing the sensitivity will allow for greater deviation, which translates to fewer detections reported.

When defining inclusions, the sensitivity setting should be left at its default value. After a period of 7 days (recommended), if too many unwarranted detection alarms are triggered, the sensitivity can be increased to the next level.

Exclusion rules#

Exclusion rules can be used to ignore one or more ML-driven detections for traffic originating from a specified source and/or bound for a specified destination.

If expected traffic/activity triggers alarms, one or more exclusion rules should be created to exempt the sources and/or destination addresses from the detections being reported.

Recommendations#

Managing dimensions#

The Plixer ML Engine’s feature dimension list defines the protocols and ports to be observed on the network assets defined by its inclusion/exclusion rules. These dimensions are used by the engine to build its behavior models, which are used to report asset behavior insights, as well as deliver anomaly and threat alerts via the Scrutinizer alarm monitor.

The default configuration for the ML Engine includes recommended dimension definitions, which are used to automatically select suitable data sources as inclusions. After the engine is deployed and set up, dimensions can be managed from the Admin > Alarm Monitor > ML Dimensions view of the Scrutinizer web interface.

Dimension configuration#

An ML dimension is defined by the following parameters:

Inclusion/asset type the dimension applies to (host/subnet or exporter interface)
Template field to use for grouping (sourceipaddress or destinationipaddress, host/subnet dimensions only)
Aggregation method to use (octetdeltacount or packetdeltacount)
Traffic port used

Note

A feature dimension is only observed for traffic associated with the type of inclusion (host/subnet or exporter interface) it was defined for.

Dimensions can be configured to apply to all or only internal traffic matching the definition. They can also be disabled and re-enabled as necessary.

Recommendations#

Global ML settings#

The global ML settings under Admin > Settings can be used to configure parameters for certain ML functions and behaviors across all engines in an environment.

The default values for the above settings/options are recommended for new ML engine deployments but may be adjusted later as described here.

AD Users#

The Plixer ML Engine is also able to ingest user activity data and access logs and alert users to anomalous behavior through user and entity behavior analytics (UEBA) detections.

UEBA alerts for Active Directory users can be enabled by adding the credentials for a Microsoft Azure account that is configured to store AD user sign-in logs under Admin > Settings > ML AD Users.

Alerts#

There are three categories of alert settings that can be adjusted under Admin > Settings > ML Alerts:

Data limits#

The ML Engine’s data limit settings manage the maximum numbers of behavior models and hosts used for network/user activity patterns and prediction. The initial values set are based on the engine’s default resource configuration, but they can be adjusted under Admin > Settings > ML Data Limits.

If there are alarms associated with these limits, the engine may need to be provisioned with additional resources to sustain the current volume of inclusions.

Note

To check the utilization for the current model limit, run an ML Engine Model Count report.

Training schedule#

The settings under Admin > Settings > ML Training Schedule determine the seasonality applied when the ML Engine ingests traffic data, allowing it to distinguish between network activity during and outside of an organization’s hours of operation.

The engine defaults to business hours of 8 am to 6 pm, from Monday to Friday. These settings can be changed after deployment if necessary.

ML cluster settings#

The ML engine is built on Kubernetes, which deploys scalable pods to handle various tasks. Most services within the ML engine consume data from Kafka, which acts as the system’s backbone for message passing both from Scrutinizer and between internal components.

Kafka allows the use of consumer groups which allow multiple pods to share workloads efficiently, making it easy to scale services horizontally by increasing the number of replicas. This supports high-throughput processing and flexible resource allocation across services like data ingestion and model training.

To ensure optimal performance based on the scale of the deployment and the volume of data processed, the engine management page can be used to register and manage ML engine deployments and configure various settings for individual engines.

Engine settings are accessed via the configuration tray, which is divided into the sections below.

Settings#

The Settings secondary tray contains the following settings that can be adjusted to adjust resource allocations for specific engine tasks/services:

Collectors#

Collectors selected here will be used as data sources for ingestion by the current engine.

DGL IP Groups#

IP groups added to the Deep Graph Learning inclusion list will be monitored by the engine to identify anomalous interactions between hosts.

ML Engine

Contents

ML Engine#

Overview#

Managing inclusion and exclusion rules#

Inclusion rules#

Inclusion sensitivity#

Exclusion rules#

Recommendations#

Managing dimensions#

Dimension configuration#

Recommendations#

Global ML settings#

AD Users#

Alerts#

Data limits#

Training schedule#

ML cluster settings#

Settings#

Collectors#

DGL IP Groups#