CLI + Bring Your Own Custom Model #8

rh-rahulshetty · 2025-12-02T12:03:40Z

This PR introduces following major changes:

1. LogAn CLI

Provide CLI to interact with LogAn library. Resolves #2

Usage

uv run logan analyze -f ./examples/Linux_2k.log -o ./tmp/debug --clean-up

uv run logan view -d ./tmp/debug

2. LogAn Container

Update containerfile to use new Logan CLI .

Usage

# Build container image from root
podman build -t logan -f Containerfile .

# Start analysis
podman run --rm \
    -v ./examples/:/data/input/:z \
    -v ./tmp/output/:/data/output/:z \
    -e LOGAN_INPUT_FILES="/data/input/Linux_2k.log" \
    -e LOGAN_OUTPUT_DIR=/data/output/ \
    logan

Logan podman container will expose different LOGAN_ variables to control model, model type and other details. Check out readme changes for more details.

3. Bring Your Own Custom Model

Provide a framework that allows users to bring custom scripts that uses Model template and perform golden signal classification and fault categorization through user custom logic. This logic should be part of a Python script/module that will be dynamically brought in during runtime.

Example

model.py

from gliner2 import GLiNER2
from logan.log_diagnosis.models.manager import ModelTemplate  # Base class for all custom models


class GLiNERModel(ModelTemplate):
    """
    A custom model implementation using GLiNER2 for log classification.
    
    GLiNER2 is a zero-shot named entity recognition and classification model
    that can be used for text classification tasks without task-specific training.
    
    This implementation demonstrates:
    - Single-label classification for golden signals
    - Multi-label classification for fault categories
    - Batch processing for efficient inference
    
    Attributes:
        extractor (GLiNER2): The GLiNER2 model instance for classification.
    """
    
    def init_model(self):
        """
        Initialize the GLiNER2 model.
        
        This method is called once after the model class is instantiated.
        Use this to load model weights, initialize tokenizers, or set up
        any resources needed for inference.
        
        Note:
            - This is separate from __init__ to allow the framework to control
              when expensive model loading happens
        """
        # Load the pre-trained GLiNER2 model from HuggingFace
        self.extractor = GLiNER2.from_pretrained("fastino/gliner2-base-v1")

    def classify_golden_signal(self, input: list[str], batch_size: int = 32) -> list[dict]:
        """
        Classify log lines into golden signal categories.
        
        Golden signals are the four key metrics from SRE practices:
        latency, traffic, errors, and saturation. This implementation
        extends it with 'information' and 'availability' for comprehensive
        log classification. You can customize the schema to include more categories.
        
        Args:
            input (list[str]): List of log text strings to classify.
                Each string is a single log line or log message.
            batch_size (int, optional): Number of samples to process in each batch.
                Larger batches are faster but use more memory. Defaults to 32.
        
        Returns:
            list[dict]: A list of dictionaries, one per input log line.
                Each dictionary contains:
                - 'labels' (list[str]): Single-element list with the predicted category
                - 'scores' (list[float]): Single-element list with confidence score (0-1)
        """
        # Define the classification schema with all golden signal categories
        # Each category has a description to help the model understand the intent
        schema = self.extractor.create_schema().classification(
            "golden_signal",
            {
                "information": "Classifies log lines that provide general operational details, status updates, or other non-critical messages.",
                "error": "Classifies log lines that indicate an error condition, malfunction, or abnormal system event.",
                "availability": "Classifies log lines related to system uptime, downtime, service availability, or accessibility.",
                "latency": "Classifies log lines that refer to delays, response times, or performance lag within the system.",
                "saturation": "Classifies log lines that signify resource exhaustion, capacity issues, or overloaded subsystems.",
                "traffic": "Classifies log lines describing system throughput, number of requests, connections, or data flow.",
            }
        )

        # Perform batch inference on all input logs
        results = self.extractor.batch_extract(
            input,
            schema,
            batch_size=batch_size,
            threshold=0.5,
            include_confidence=True,
        )

        # Transform results to the expected output format
        # The framework expects a list of dicts with 'labels' and 'scores' keys
        formatted_result = []
        for result in results:
            # Extract the golden_signal classification from results
            classification_result = result['golden_signal']
            
            # Format as expected: single label and score in lists
            # This is single-label classification (one label per log)
            formatted_result.append({
                'labels': [classification_result['label']],
                'scores': [classification_result['confidence']]
            })
        
        return formatted_result

    def classify_fault_category(self, input: list[str], batch_size: int = 32) -> list[dict]:
        """
        Classify log lines into fault categories (multi-label classification).
        
        Fault categories identify the domain or type of issue a log relates to.
        Unlike golden signals, a single log can belong to multiple fault categories
        (e.g., a log about network authentication failure could be both 'network' 
        and 'authentication').
        
        Args:
            input (list[str]): List of log text strings to classify.
                Each string is a single log line or log message.
            batch_size (int, optional): Number of samples to process in each batch.
                Larger batches are faster but use more memory. Defaults to 32.
        
        Returns:
            list[dict]: A list of dictionaries, one per input log line.
                Each dictionary contains:
                - 'labels' (list[str]): List of predicted category labels (can be multiple)
                - 'scores' (list[float]): Corresponding confidence scores for each label
        """
        # Define multi-label classification schema for fault categories
        # multi_label=True allows multiple categories per log
        schema = self.extractor.create_schema().classification(
            "fault_category",
            {
                "io": "Classifies log lines that refer to input/output operations, file system interactions, or data transfer.",
                "authentication": "Classifies log lines that relate to user authentication, authorization, or security protocols.",
                "network": "Classifies log lines that describe network connectivity, routing, or protocol interactions.",
                "application": "Classifies log lines that refer to application-level events, process interactions, or service calls.",
                "device": "Classifies log lines that relate to hardware device status, driver activity, or peripheral operations.",
            },
            multi_label=True,
            cls_threshold=0.3,
        )

        # Perform batch inference with multi-label classification
        results = self.extractor.batch_extract(
            input,
            schema,
            batch_size=batch_size,
            threshold=0.3,
            include_confidence=True,
        )

        # Transform results to the expected output format
        # For multi-label, each log may have multiple labels and scores
        formatted_result = []
        for result in results:
            # Extract fault_category results from relation_extraction output
            # GLiNER2 returns multi-label results differently than single-label
            classification_results = result['relation_extraction']['fault_category']
            
            # Build lists of labels and their corresponding scores
            label_score = {
                'labels': [],
                'scores': []
            }
            
            # classification_results is a list of (label, score) tuples
            for label, score in classification_results:
                label_score['labels'].append(label)
                label_score['scores'].append(score)
            
            formatted_result.append(label_score)
        
        return formatted_result

Run Command

# Using CLI
uv run logan analyze \
        -f "examples/Linux_2k.log" \
        -o "tmp/debug" \
        --model-type custom \
        --model "examples/tutorials/custom_model.py:GLiNERModel" \
        --clean-up

# Using Container
podman run --rm \
        -v ./examples/tutorials:/data/extra/:z \
        -v ./examples/:/data/input/:z \
        -v ./tmp/output/:/data/output/:z \
        -e LOGAN_INPUT_FILES="/data/input/Linux_2k.log" \
        -e LOGAN_OUTPUT_DIR=/data/output/ \
        -e LOGAN_MODEL_TYPE=custom \
        -e LOGAN_MODEL="/data/extra/custom_model.py:GLiNERModel" \
        logan

4. Github CI Workflow

The workflow automatically is triggered whenever we publish a new release and its task is to build and publish container image in Github Container Registry (ghcr.io).

5. Other Updates

Remove drain3 code from repository and install it as package from pypi.
Restructured code to move all core logic inside logan module.
Added setup.py for one command installation with pip or uv.

Signed-off-by: Rahul Shetty <rashetty@redhat.com>

rh-rahulshetty · 2025-12-18T07:50:33Z

Built the following image with Github action on my profile.

Example Usage:

podman run --rm \
    -v ./examples/:/data/input/:z \
    -v ./tmp/output/:/data/output/:z \
    -e LOGAN_INPUT_FILES="/data/input/Linux_2k.log" \
    -e LOGAN_OUTPUT_DIR=/data/output/ \
    ghcr.io/rh-rahulshetty/logan:0.0.1

More info: https://github.com/rh-rahulshetty/LogAn/pkgs/container/logan/615819004?tag=latest

rh-rahulshetty added 2 commits November 28, 2025 17:53

catch error when time extraction fails and log the line

d8ad4a7

Signed-off-by: Rahul Shetty <rashetty@redhat.com>

custom model initial commit

4703bd3

Signed-off-by: Rahul Shetty <rashetty@redhat.com>

rh-rahulshetty requested a review from Pranjal-Gupta2 December 9, 2025 06:06

rh-rahulshetty added 3 commits December 10, 2025 18:11

custom model example

e288fb4

Signed-off-by: Rahul Shetty <rashetty@redhat.com>

add tutorial example for custom model

f69dc91

Signed-off-by: Rahul Shetty <rashetty@redhat.com>

logan code restructure + cli

cec5eff

Signed-off-by: Rahul Shetty <rashetty@redhat.com>

rh-rahulshetty changed the title ~~Bring Your Own Custom Model~~ CLI + Bring Your Own Custom Model Dec 15, 2025

rh-rahulshetty added 3 commits December 15, 2025 16:24

fix anomaly report issue

f57bcd6

Signed-off-by: Rahul Shetty <rashetty@redhat.com>

update new container usage

1ddd62c

Signed-off-by: Rahul Shetty <rashetty@redhat.com>

update custom model usage with new cli and podman

63cf84a

Signed-off-by: Rahul Shetty <rashetty@redhat.com>

rh-rahulshetty marked this pull request as ready for review December 15, 2025 12:34

rh-rahulshetty added 5 commits December 17, 2025 12:55

add ghcr build workflow for release

c6cc400

Signed-off-by: Rahul Shetty <rashetty@redhat.com>

test workflow

8fa401e

Signed-off-by: Rahul Shetty <rashetty@redhat.com>

revert

288e30e

Signed-off-by: Rahul Shetty <rashetty@redhat.com>

fix container issue

c65b3d7

Signed-off-by: Rahul Shetty <rashetty@redhat.com>

switch to RH python3 base image

0e78754

Signed-off-by: Rahul Shetty <rashetty@redhat.com>

rh-rahulshetty added the enhancement New feature or request label Dec 18, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

CLI + Bring Your Own Custom Model #8

CLI + Bring Your Own Custom Model #8

Uh oh!

rh-rahulshetty commented Dec 2, 2025 •

edited

Loading

Uh oh!

rh-rahulshetty commented Dec 18, 2025 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

CLI + Bring Your Own Custom Model #8

Are you sure you want to change the base?

CLI + Bring Your Own Custom Model #8

Uh oh!

Conversation

rh-rahulshetty commented Dec 2, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

1. LogAn CLI

Usage

2. LogAn Container

Usage

3. Bring Your Own Custom Model

Example

model.py

Run Command

4. Github CI Workflow

5. Other Updates

Uh oh!

rh-rahulshetty commented Dec 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

rh-rahulshetty commented Dec 2, 2025 •

edited

Loading

rh-rahulshetty commented Dec 18, 2025 •

edited

Loading