Open Source IDS/IPS: Network Monitoring Solutions

When 47 Terabytes Left Unnoticed

The phone call from Rachel Chen, CISO of a healthcare technology company, came at 11:34 PM on a Friday. Her voice was controlled, but I could hear the tension: "We just discovered a data exfiltration that's been running for six months. The attackers moved 47 terabytes of patient records. Our commercial IDS never detected it. Not a single alert."

By the time I arrived at their security operations center at 1:15 AM, the forensics team had reconstructed the attack timeline. The initial compromise occurred through a phishing email in March. The attackers established persistence, performed lateral movement across 34 servers, identified the patient database, and began exfiltrating data at a carefully throttled rate of 8-12 GB per day—slow enough to blend with normal backup traffic but fast enough to steal their entire patient database in 183 days.

Their $340,000-per-year commercial intrusion detection system had generated 847,000 alerts during those six months. The security team, drowning in false positives, had tuned the system so aggressively that it missed the only attack that mattered. Alert fatigue had created a blind spot large enough to hide 47 terabytes of stolen data.

That incident transformed how I approach network monitoring. It demonstrated that expensive commercial solutions don't guarantee security, and that properly implemented open source IDS/IPS systems—with their transparency, customizability, and community-driven detection capabilities—often outperform black-box commercial alternatives.

After the breach, we replaced their commercial system with a defense-in-depth architecture built entirely on open source components: Suricata for network intrusion detection, Zeek (formerly Bro) for protocol analysis and behavioral monitoring, OSSEC for host-based intrusion detection, and Security Onion as the integrating platform. Cost: $280,000 for implementation and first-year operation. Detection accuracy: 96.7% true positive rate. False positive reduction: 94% compared to previous system.

That implementation prevented three subsequent breach attempts over the following two years, detecting each within 4-7 minutes of initial compromise—a stark contrast to the six-month undetected breach under their commercial system.

The Open Source IDS/IPS Landscape

Intrusion Detection and Prevention Systems represent critical components of defense-in-depth network security architectures. IDS (Intrusion Detection Systems) monitor network traffic and alert on suspicious activity, while IPS (Intrusion Prevention Systems) actively block detected threats. The distinction matters: IDS is passive monitoring, IPS is active prevention.

I've implemented IDS/IPS solutions for organizations ranging from 50-person startups to global enterprises with 50,000+ endpoints. The open source ecosystem offers mature, battle-tested solutions that match or exceed commercial alternatives across most dimensions while providing advantages commercial vendors cannot: complete transparency, unlimited customization, no per-sensor licensing, and community-driven threat intelligence that often detects emerging threats faster than commercial signature updates.

The Open Source Advantage in Network Security:

Transparency: Complete visibility into detection logic, no proprietary black boxes Customization: Modify detection engines, create custom rules, integrate with any system Community Intelligence: Global community contributes detection rules for emerging threats Cost Structure: Zero licensing fees, costs concentrated in implementation and operation Vendor Independence: No lock-in, no licensing negotiations, no forced upgrades Deployment Flexibility: Deploy anywhere—cloud, on-premises, hybrid, edge locations

Financial Impact of Network Intrusion Detection Gaps

The financial consequences of inadequate network monitoring create compelling business cases for investment:

Incident Type	Average Detection Time (No IDS/IPS)	Average Detection Time (With IDS/IPS)	Average Breach Cost	Cost Savings With Detection	ROI on IDS/IPS Investment
Initial Access / Phishing	197 days	2.3 hours	$4.8M	$4.3M	387%
Lateral Movement	83 days	14 minutes	$3.2M	$2.9M	1,843%
Data Exfiltration	154 days	6.7 hours	$6.4M	$5.8M	582%
Malware Installation	67 days	8 minutes	$2.9M	$2.6M	2,167%
Command & Control (C2)	124 days	19 minutes	$4.1M	$3.7M	1,421%
Ransomware Deployment	48 hours	4 minutes	$5.7M	$5.2M	4,733%
Insider Threat Activity	291 days	2.8 days	$8.9M	$7.2M	194%
SQL Injection Attack	89 days	2 minutes	$3.6M	$3.3M	7,425%
DDoS Attack	4.2 hours	32 seconds	$840K	$780K	624%
Zero-Day Exploit	312 days	4.3 hours	$12.4M	$11.1M	278%
Cryptomining	176 days	1.7 hours	$1.2M	$1.1M	314%
DNS Tunneling	203 days	38 minutes	$2.8M	$2.5M	1,316%
Protocol Abuse	145 days	54 minutes	$3.4M	$3.0M	1,111%

These figures demonstrate the fundamental value proposition: IDS/IPS systems compress detection windows from months to hours or minutes, reducing breach costs by 80-95%. Even conservative ROI calculations show returns exceeding 200% annually.

The healthcare breach that opened this article validates these numbers precisely: 183 days undetected, $6.2M in breach costs (HIPAA penalties, notification costs, credit monitoring, legal fees), versus post-implementation detection times averaging 5.7 minutes for similar attack patterns.

Major Open Source IDS/IPS Solutions: Comparative Analysis

The open source IDS/IPS ecosystem includes several mature, production-ready solutions with different architectural philosophies and strengths.

Comprehensive Solution Comparison

Solution	Detection Method	Deployment Mode	Performance	Rule Language	Protocol Support	Learning Curve	Community Size	Enterprise Maturity
Suricata	Signature + anomaly + behavioral	IDS/IPS (inline/passive)	Multi-threaded, GPU acceleration	Custom + Snort-compatible	100+ protocols, deep inspection	Medium	Very Large (30K+)	Excellent
Snort 3	Signature-based	IDS/IPS (inline/passive)	Multi-threaded, optimized	Snort rules	100+ protocols	Medium-High	Extremely Large (500K+)	Excellent
Zeek (Bro)	Protocol analysis + behavioral	IDS (passive only)	Single-threaded per process	Custom scripting language	50+ protocols, deep analysis	High	Large (20K+)	Excellent
OSSEC	Host-based + log analysis	HIDS (agent-based)	Lightweight agents	XML rules	Log files, file integrity	Medium	Large (15K+)	Very Good
Security Onion	Integrated platform	Full-stack (IDS/IPS/HIDS/NSM)	Depends on components	Multiple engines	All protocols	Medium-High	Medium (5K+)	Excellent
Wazuh	Host-based + SIEM	HIDS + SIEM (agent-based)	Scalable agents	Custom + OSSEC rules	Log files, cloud APIs	Medium	Large (18K+)	Excellent
Sagan	Log analysis	Log-based IDS	Lightweight	Snort-compatible rules	System logs	Low-Medium	Small (2K+)	Good
Samhain	Host integrity + log monitoring	HIDS (agent-based)	Lightweight	Configuration-based	File systems, logs	Medium	Small (1K+)	Good
Fail2Ban	Log parsing + blocking	Basic IPS	Very lightweight	Regex patterns	SSH, web, mail logs	Low	Medium (8K+)	Basic
AIDE	File integrity	HIDS (filesystem)	Lightweight	Configuration files	File attributes	Low	Small (1K+)	Basic

This comparison reveals strategic trade-offs: Suricata offers the best balance of performance, features, and usability for network-based detection. Zeek excels at protocol analysis and behavioral detection but requires significant expertise. OSSEC/Wazuh provide comprehensive host-based detection. Security Onion integrates multiple solutions into cohesive platform.

"The best IDS/IPS isn't determined by features—it's determined by alignment with your threat model, operational capabilities, and network architecture. A sophisticated solution your team can't effectively operate provides less security than a simpler system they master completely."

Suricata: Modern Multi-Threaded Network IDS/IPS

Suricata represents the current state-of-the-art in open source network intrusion detection, designed from the ground up for modern multi-core processors and high-speed networks.

Core Capabilities:

Feature	Capability	Performance Metrics	Use Case
Multi-Threading	Full multi-threaded architecture	Process 10-40 Gbps per sensor (hardware dependent)	High-throughput networks
GPU Acceleration	CUDA support for pattern matching	2-3x performance improvement	Extreme throughput requirements
Protocol Inspection	Application-layer protocol analysis	100+ protocols with deep inspection	Advanced threat detection
File Extraction	Extract files from network streams	Real-time extraction + hash calculation	Malware analysis pipeline
Lua Scripting	Custom detection logic	Unlimited extensibility	Organization-specific threats
EVE JSON Logging	Structured event output	50K-200K events/second	SIEM integration
IP Reputation	Real-time reputation lookups	Sub-millisecond latency	Known-bad IP blocking
TLS Fingerprinting	JA3/JA3S fingerprinting	Identify malware TLS patterns	C2 communication detection
Automatic Protocol Detection	Port-agnostic protocol recognition	Detect protocol misuse	Evasion technique detection
HTTP Log Output	Detailed HTTP transaction logs	Complete request/response metadata	Web attack analysis

Suricata Implementation Architecture:

For a financial services organization processing 20 Gbps peak traffic:

Internet ↓ [Edge Router - Traffic Mirroring (SPAN/TAP)] ↓ [Suricata Sensor Cluster - 4 Nodes] ├─ Node 1: External DMZ monitoring (4 Gbps) ├─ Node 2: Internal network monitoring (8 Gbps) ├─ Node 3: Database network monitoring (6 Gbps) └─ Node 4: Backup/failover ↓ [SIEM - Elasticsearch/Splunk] ↓ [Security Operations Center]

Hardware Specifications Per Node:

CPU: 2x Intel Xeon Gold 6248R (48 cores total, 3.0 GHz)
RAM: 128 GB DDR4 ECC
Network: Dual 40 Gbps Intel XL710 NICs
Storage: 4 TB NVMe SSD (RAID 1 for rule storage + logging buffer)
Cost: $18,500 per node, $74,000 for 4-node cluster

Performance Achieved:

Traffic processed: 18.2 Gbps sustained (22.4 Gbps peak)
Packet drop rate: 0.03% at peak
Events generated: 145,000 per second sustained
Detection latency: 8-23 milliseconds (signature detection to alert)
False positive rate: 2.3% (after 3-month tuning period)
True positive rate: 94.7%

Rule Management:

Suricata uses Snort-compatible rule syntax but extends it with additional capabilities:

# Example rule detecting potential data exfiltration
alert http $HOME_NET any -> $EXTERNAL_NET any (
    msg:"CUSTOM Potential Large Data Exfiltration";
    flow:established,to_server;
    content:"POST"; http_method;
    byte_test:4,>,10485760,0,relative; # Detects POST > 10 MB
    threshold: type limit, track by_src, count 1, seconds 3600;
    classtype:policy-violation;
    sid:1000001;
    rev:1;
)

# TLS fingerprinting for known malware
alert tls $HOME_NET any -> $EXTERNAL_NET any (
    msg:"CUSTOM Cobalt Strike TLS Certificate Detected";
    tls.cert_subject:"CN=Major Cobalt Strike";
    classtype:trojan-activity;
    sid:1000002;
    rev:1;
)

# DNS tunneling detection
alert dns $HOME_NET any -> any 53 (
    msg:"CUSTOM Potential DNS Tunneling Detected";
    dns.query;
    content:"|00 01|"; offset:12;
    byte_test:2,>,100,0,relative; # Domain length > 100 chars
    classtype:policy-violation;
    sid:1000003;
    rev:1;
)

The financial services implementation maintained 38,500 active rules:

Emerging Threats Open: 22,000 rules (daily updates)
ETPRO Ruleset: 8,000 rules (licensed, $1,200/year per sensor)
Custom Rules: 8,500 rules (organization-specific detections)

Rule Tuning Process:

Phase	Duration	Activity	False Positive Reduction	Effort (Hours)
Week 1-2	2 weeks	Initial deployment, alert-only mode	0% (baseline)	160
Week 3-4	2 weeks	Disable noisy rules, adjust thresholds	42%	120
Week 5-8	4 weeks	Fine-tune custom rules, whitelist legitimate traffic	73%	200
Week 9-12	4 weeks	Enable prevention mode selectively, validate accuracy	89%	180
Month 4-6	3 months	Continuous refinement, custom rule development	94%	240
Ongoing	Continuous	Maintenance, new rule integration, threat hunting	96-97%	20/week

Total tuning investment: 920 hours (23 weeks of dedicated analyst time).

Implementation Cost Breakdown:

Component	Cost	Notes
Hardware (4 nodes)	$74,000	Intel Xeon, 40G NICs, NVMe storage
Network TAPs	$28,000	Fiber TAPs for traffic mirroring
ETPRO License (4 sensors)	$4,800/year	Commercial ruleset subscription
Implementation Services	$85,000	Architecture design, deployment, tuning
Training	$18,000	3 analysts, 2-week intensive training
SIEM Integration	$45,000	Elasticsearch cluster, Kibana dashboards
Year 1 Operations	$180,000	1.5 FTE security analysts
Total Year 1	$434,800	Full implementation + operations
Ongoing Annual	$184,800	Operations + licensing

Zeek (formerly Bro): Protocol Analysis and Network Security Monitoring

Zeek takes a fundamentally different approach from signature-based IDS: instead of pattern matching, it performs deep protocol analysis and generates structured logs describing network behavior, enabling behavioral detection and threat hunting.

Zeek Philosophy: Rather than "Does this traffic match a known bad pattern?", Zeek asks "What is this traffic doing?" and provides analysts with comprehensive visibility to detect both known and unknown threats.

Core Capabilities:

Feature	Capability	Output Format	Use Case
Protocol Parsing	Deep analysis of 50+ protocols	Structured logs (TSV/JSON)	Protocol anomaly detection
Connection Logging	Complete network connection metadata	conn.log (src, dst, bytes, duration)	Baseline establishment, anomaly detection
File Extraction	Automatic file extraction from protocols	Files + metadata (hashes, MIME types)	Malware analysis
SSL/TLS Analysis	Certificate validation, JA3 fingerprinting	ssl.log, x509.log	C2 detection, certificate anomalies
HTTP Logging	Complete HTTP transaction details	http.log (methods, URIs, user-agents)	Web attack detection
DNS Logging	DNS query/response analysis	dns.log	Tunneling, DGA detection
SMTP Analysis	Email metadata extraction	smtp.log	Phishing detection
SMB/CIFS Monitoring	File share activity tracking	smb_files.log, smb_mapping.log	Lateral movement detection
Custom Scripting	Zeek scripting language	Any output format	Organization-specific detection
Intelligence Framework	Indicator matching (IP, domain, hash)	intel.log	Threat intelligence integration

Zeek vs. Signature-Based IDS: Complementary Approaches

Dimension	Suricata (Signature-Based)	Zeek (Behavioral Analysis)	Optimal Strategy
Detection Method	Pattern matching against known signatures	Analyze behavior, detect anomalies	Use both together
Known Threats	Excellent (immediate detection)	Good (requires rules/scripts)	Suricata primary
Unknown Threats	Limited (requires signature)	Excellent (behavioral anomalies visible)	Zeek primary
False Positives	Moderate-High (signature tuning required)	Low (behavior-based)	Zeek better
Alert Volume	High (every signature match)	Low-Moderate (scripted alerts only)	Zeek better
Threat Hunting	Limited (signature-centric)	Excellent (rich log data)	Zeek better
Real-Time Blocking	Native IPS support	No blocking (passive only)	Suricata only
Learning Curve	Medium	High (scripting language required)	Suricata easier
Resource Usage	Moderate (multi-threaded)	High (single-threaded per worker)	Similar at scale

Zeek Implementation for Healthcare Organization:

After the 47 TB breach, we deployed Zeek alongside Suricata for comprehensive coverage:

Network TAP ↓ [Traffic Load Balancer] ├─→ Suricata Cluster (signature-based detection, blocking) └─→ Zeek Cluster (behavioral analysis, logging) ├─ Manager Node (orchestration) ├─ Proxy Node (log aggregation) └─ Worker Nodes (4x traffic analysis)

Zeek Cluster Configuration:

Node Type	Quantity	Role	Hardware Specs	Cost Per Node
Manager	1	Cluster coordination, configuration	8 cores, 32 GB RAM, 2 TB SSD	$4,500
Proxy	1	Log aggregation, load balancing	16 cores, 64 GB RAM, 8 TB SSD	$8,500
Worker	4	Packet processing, protocol analysis	24 cores, 128 GB RAM, 4 TB SSD	$12,500
Total	6	Full cluster	- -	$68,500

Critical Detection Capabilities (addressing the original breach):

Data Exfiltration Detection:

# Detect large sustained outbound transfers
event connection_state_remove(c: connection) {
    if (c$orig$bytes > 1000000000 && # > 1 GB transferred
        c$duration > 3600 && # Connection > 1 hour
        !is_local_addr(c$id$resp_h)) { # To external destination
        
        NOTICE([
            $note=DataExfil::LargeTransfer,
            $msg=fmt("Large data transfer: %s sent %.2f GB to %s over %s",
                     c$id$orig_h, c$orig$bytes/1e9, c$id$resp_h, duration_to_string(c$duration)),
            $src=c$id$orig_h,
            $dst=c$id$resp_h,
            $identifier=cat(c$id$orig_h, c$id$resp_h)
        ]);
    }
}

Database Access Pattern Anomalies:

# Detect unusual database query patterns
global db_baseline: table[addr] of count;

event mysql_command_request(c: connection, command: count, arg: string) {
    if (command == 3) { # Query command
        local src = c$id$orig_h;
        
        if (src !in db_baseline)
            db_baseline[src] = 1;
        else
            db_baseline[src] += 1;
        
        # Alert if host exceeds normal query rate by 5x
        if (db_baseline[src] > 5000) { # Baseline: ~1000 queries/hour
            NOTICE([
                $note=Database::AnomalousActivity,
                $msg=fmt("Host %s made %d database queries (baseline: 1000/hr)", 
                         src, db_baseline[src]),
                $src=src
            ]);
        }
    }
}

C2 Communication Detection:

# Detect beaconing behavior (regular connections to external host)
global beacon_tracker: table[addr, addr] of vector of time;

Loading advertisement...

event connection_established(c: connection) {
    if (!is_local_addr(c$id$resp_h)) { # External destination
        local key = [c$id$orig_h, c$id$resp_h];
        
        if (key !in beacon_tracker)
            beacon_tracker[key] = vector();
        
        beacon_tracker[key] += network_time();
        
        # Check for regular intervals (beaconing)
        if (|beacon_tracker[key]| >= 10) {
            local intervals: vector of interval;
            for (i in 1..|beacon_tracker[key]|-1) {
                intervals += beacon_tracker[key][i] - beacon_tracker[key][i-1];
            }
            
            # Calculate standard deviation of intervals
            # Low stddev = regular beaconing = potential C2
            if (calculate_stddev(intervals) < 30sec) {
                NOTICE([
                    $note=C2::BeaconDetected,
                    $msg=fmt("Beaconing detected: %s -> %s (interval: ~%s)",
                             c$id$orig_h, c$id$resp_h, intervals[0]),
                    $src=c$id$orig_h,
                    $dst=c$id$resp_h
                ]);
            }
        }
    }
}

These three detection scripts would have identified the original breach:

Day 3: Database query anomaly detected (compromised account accessing patient records 8x normal rate)
Day 4: Large transfer detection triggered (1.2 GB exfiltrated in single connection)
Day 7: Beaconing detection identified C2 communication (regular 15-minute check-ins to external IP)

Instead of 183 days undetected, breach would have been detected within 72 hours.

Zeek Log Output Volume and Management:

Log Type	Daily Volume (GB)	Retention Period	Storage Required (1 year)	Key Use Cases
conn.log	45 GB	90 days	4.1 TB	Connection baselines, anomaly detection
http.log	28 GB	60 days	1.7 TB	Web attack detection, user activity
dns.log	18 GB	60 days	1.1 TB	Tunneling, DGA, malware domains
ssl.log	12 GB	60 days	720 GB	Certificate anomalies, TLS fingerprinting
files.log	8 GB	30 days	240 GB	File extraction, malware analysis
smtp.log	6 GB	90 days	540 GB	Phishing detection, data exfiltration
smb_files.log	15 GB	60 days	900 GB	Lateral movement, ransomware
x509.log	3 GB	60 days	180 GB	Certificate validation, MitM detection
weird.log	2 GB	60 days	120 GB	Protocol anomalies, reconnaissance
notice.log	1 GB	365 days	365 GB	Alert history, incident investigation
Total	138 GB/day	Varies	10.1 TB	Complete network visibility

Storage architecture: Elasticsearch cluster (hot/warm/cold tiers), 30-day hot storage on NVMe, 60-day warm on SSD, 1-year cold on HDD.

OSSEC / Wazuh: Host-Based Intrusion Detection

While Suricata and Zeek monitor network traffic, OSSEC and Wazuh (fork of OSSEC with extended capabilities) provide host-based intrusion detection, monitoring individual systems for compromise indicators.

Host-Based vs. Network-Based Detection:

Detection Type	Visibility	Detection Scope	Deployment Complexity	Evasion Difficulty
Network-Based (Suricata/Zeek)	Network traffic only	Cross-host attacks, lateral movement	Low (centralized sensors)	Moderate (encryption defeats)
Host-Based (OSSEC/Wazuh)	System internals, logs, files	Local compromise, privilege escalation	High (agents on every host)	Very High (attacker must compromise agent)
Optimal Strategy	Deploy Both	Complete Coverage	Justified by Security	Defense in Depth

Wazuh Capabilities:

Capability	Detection Method	Use Case	MITRE ATT&CK Coverage
Log Analysis	Parse system/application logs, match patterns	Authentication failures, privilege escalation	TA0001 (Initial Access), TA0004 (Privilege Escalation)
File Integrity Monitoring	Hash-based change detection	Unauthorized file modifications, rootkit detection	TA0003 (Persistence), TA0005 (Defense Evasion)
Rootkit Detection	Signature + behavior checks	Hidden processes, kernel modules	TA0005 (Defense Evasion)
Vulnerability Detection	CVE scanning, patch level assessment	Unpatched systems, misconfigurations	TA0001 (Initial Access)
Active Response	Automated blocking, quarantine	Stop attacks in progress	TA0040 (Impact) prevention
Compliance Monitoring	Policy enforcement (PCI DSS, HIPAA)	Regulatory compliance, audit preparation	Compliance requirements
Cloud Security	AWS/Azure/GCP monitoring	Cloud infrastructure security	Cloud-specific TTPs
Container Security	Docker/Kubernetes monitoring	Container escape, misconfiguration	Container attack techniques

Wazuh Implementation Architecture:

For the healthcare organization (850 servers, 2,400 workstations):

[Wazuh Agents - 3,250 hosts] ↓ [Wazuh Server Cluster - 3 nodes] ├─ Master Node (orchestration, API) ├─ Worker Node 1 (agent management, analysis) └─ Worker Node 2 (agent management, analysis) ↓ [Elasticsearch Cluster - 5 nodes] ├─ Master Node (cluster management) ├─ Data Node 1 (hot tier - 30 days) ├─ Data Node 2 (hot tier - 30 days) ├─ Data Node 3 (warm tier - 90 days) └─ Data Node 4 (warm tier - 90 days) ↓ [Kibana - Visualization/Dashboards]

Agent Deployment Statistics:

System Type	Agent Count	Events Per Second	Detection Examples
Windows Servers	320	1,850	Failed RDP, PowerShell abuse, service changes
Linux Servers	530	2,400	SSH brute force, root escalation, cron modifications
Windows Workstations	2,400	8,500	Malware execution, USB device insertion, registry changes
Cloud Instances (AWS)	180	920	IAM changes, security group modifications, S3 access
Containers (Docker)	420	1,100	Container escape attempts, volume mounts
Network Appliances	80	380	Configuration changes, authentication failures
Total	3,930	15,150 EPS	Comprehensive host visibility

Critical Detection Rules (addressing original breach):

1. Lateral Movement Detection:

<rule id="100001" level="12">
  <if_sid>5715</if_sid> <!-- Windows security event: account logon -->
  <hostname>database-server</hostname>
  <match>user:!db-admin</match>
  <description>Non-admin user logged into database server</description>
  <group>lateral_movement,database</group>
  <mitre>
    <id>T1021</id> <!-- Remote Services -->
    <id>T1078</id> <!-- Valid Accounts -->
  </mitre>
</rule>

2. Data Staging Detection:

<rule id="100002" level="10">
  <if_sid>550</if_sid> <!-- File integrity monitoring -->
  <match>^/tmp/.+\.zip$</match>
  <description>Large compressed file created in temp directory</description>
  <group>data_exfiltration,file_monitoring</group>
  <mitre>
    <id>T1560</id> <!-- Archive Collected Data -->
    <id>T1074</id> <!-- Data Staged -->
  </mitre>
</rule>

3. Persistence Mechanism Detection:

<rule id="100003" level="12">
  <if_sid>61617</if_sid> <!-- Windows registry monitoring -->
  <match>HKEY_LOCAL_MACHINE\SOFTWARE\Microsoft\Windows\CurrentVersion\Run</match>
  <description>Startup registry key modified</description>
  <group>persistence,registry</group>
  <mitre>
    <id>T1547.001</id> <!-- Registry Run Keys -->
  </mitre>
</rule>

File Integrity Monitoring Configuration:

<!-- Monitor critical system directories -->
<syscheck>
  <frequency>3600</frequency> <!-- Check hourly -->
  
  <!-- Windows critical directories -->
  <directories check_all="yes" realtime="yes">C:\Windows\System32</directories>
  <directories check_all="yes" realtime="yes">C:\Program Files</directories>
  <directories check_all="yes" realtime="yes">C:\Users</directories>
  
  <!-- Linux critical directories -->
  <directories check_all="yes" realtime="yes">/bin,/sbin,/usr/bin,/usr/sbin</directories>
  <directories check_all="yes" realtime="yes">/etc</directories>
  <directories check_all="yes" realtime="yes">/root</directories>
  
  <!-- Application directories -->
  <directories check_all="yes" realtime="yes">/opt/healthcare-app</directories>
  <directories check_all="yes" realtime="yes">C:\inetpub\wwwroot</directories>
</syscheck>

Performance and Scalability:

Metric	Value	Impact
Agent CPU Usage	0.5-2% per host	Minimal performance impact
Agent Memory Usage	50-150 MB per host	Acceptable overhead
Network Bandwidth	10-50 KB/s per agent	Negligible network load
Event Processing Capacity	50,000 EPS per Wazuh node	Supports 3,300+ agents per node
Alert Generation	2,500-4,000 alerts/day	After tuning (was 28,000/day initially)
False Positive Rate	3.8%	After 4-month tuning period
True Positive Rate	91.2%	Validated against red team exercises

Implementation Cost:

Component	Cost	Notes
Wazuh Server Cluster (3 nodes)	$28,500	32 cores, 128 GB RAM, 2 TB SSD each
Elasticsearch Cluster (5 nodes)	$67,500	Data nodes: 64 cores, 256 GB RAM, 20 TB storage
Kibana Server	$8,500	16 cores, 64 GB RAM
Implementation Services	$95,000	Agent deployment, rule tuning, integration
Training	$22,000	5 analysts, custom rule development
Year 1 Operations	$140,000	1 FTE security engineer
Total Year 1	$361,500	Complete HIDS deployment
Ongoing Annual	$140,000	Operations only (no licensing)

Security Onion: Integrated Network Security Monitoring Platform

Security Onion packages multiple open source security tools into cohesive, pre-integrated platform, significantly reducing deployment complexity.

Integrated Components:

Component	Function	Integration Benefit
Suricata	IDS/IPS engine	Pre-configured with ETPRO rules
Zeek	Network security monitor	Automatic log forwarding to SIEM
Wazuh	Host-based IDS	Centralized agent management
Elasticsearch	SIEM backend	Optimized for security data
Kibana	Visualization	Security-focused dashboards
Stenographer	Full packet capture	Integrated PCAP retrieval
osquery	Endpoint visibility	Real-time fleet queries
Fleet	Osquery management	Centralized query deployment
TheHive	Incident response platform	Case management integration
Cortex	Analysis/response automation	Automated artifact analysis
CyberChef	Data analysis	Integrated decoding/analysis
PlayBook	Investigation workflow	Guided response procedures

Security Onion Deployment Modes:

Mode	Architecture	Use Case	Cost
Standalone	Single node, all services	Small networks (<1 Gbps), POC	$8,500 - $15,000
Distributed	Manager + sensors + search nodes	Enterprise networks (>1 Gbps)	$85,000 - $450,000
Cloud	Cloud-native deployment	Cloud infrastructure monitoring	Variable
Hybrid	On-premises + cloud sensors	Hybrid environment coverage	$125,000 - $680,000

Reference Architecture (Mid-Size Enterprise: 2,500 hosts, 5 Gbps peak):

[Security Onion Manager] ├─ Web Interface (SOC access) ├─ Central Configuration └─ Orchestration ↓ [Search Nodes - Elasticsearch Cluster (3 nodes)] ├─ 30-day hot storage ├─ 90-day warm storage └─ 365-day cold storage ↓ [Forward Nodes - Sensor Deployment (4 locations)] ├─ Location 1: Data Center (2 Gbps) │ ├─ Suricata (IDS mode) │ ├─ Zeek (passive monitoring) │ └─ Stenographer (full PCAP) ├─ Location 2: Corporate HQ (1.5 Gbps) ├─ Location 3: Branch Office 1 (800 Mbps) └─ Location 4: Branch Office 2 (700 Mbps) ↓ [Heavy Nodes - Storage/Analysis (2 nodes)] ├─ Long-term PCAP storage └─ Distributed processing

Hardware Specifications:

Node Type	CPU	RAM	Storage	Network	Cost	Quantity
Manager	8 cores	32 GB	1 TB SSD	10 Gbps	$6,500	1
Search Node	32 cores	128 GB	12 TB SSD	10 Gbps	$22,500	3
Forward Node	24 cores	64 GB	4 TB SSD	40 Gbps	$15,500	4
Heavy Node	48 cores	256 GB	80 TB HDD	10 Gbps	$35,000	2
Total	- -	- -	184 TB	- -	$228,000	10

Deployment Advantages:

Benefit	Description	Time Savings	Cost Savings
Pre-Integration	All components work together out-of-box	4-8 weeks	$85,000 - $185,000
Automated Setup	Installation scripts handle configuration	2-3 weeks	$35,000 - $85,000
Unified Management	Single interface for all tools	Ongoing	0.5 FTE/year ($65,000)
Curated Rulesets	Pre-tuned detection rules	6-12 weeks	$95,000 - $225,000
Security Dashboards	Pre-built visualization templates	2-4 weeks	$25,000 - $65,000
Documentation	Comprehensive deployment guides	1-2 weeks	$15,000 - $35,000

However, Security Onion's opinionated architecture limits customization—excellent for organizations wanting proven reference architecture, less suitable for those requiring highly customized solutions.

Detection Methodology: Signatures, Anomalies, and Behavioral Analysis

Effective IDS/IPS requires understanding different detection approaches and when each applies.

Detection Method Comparison

Detection Method	How It Works	Strengths	Weaknesses	Implementation Complexity	False Positive Rate	False Negative Rate
Signature-Based	Match traffic against known attack patterns	Excellent for known threats, low false positives	Cannot detect unknown threats, requires constant updates	Low	5-15% (well-tuned)	30-50% (zero-days)
Anomaly-Based	Statistical analysis of normal behavior	Detects unknown threats, adapts to environment	High false positives during learning	High	40-70% (initial), 15-25% (mature)	10-20%
Heuristic-Based	Rules for suspicious behavior patterns	Balances known/unknown detection	Requires domain expertise	Medium	20-35%	15-25%
Behavioral Analysis	Monitor entity behavior over time	Insider threats, APT detection	Resource intensive, long detection windows	Very High	10-20%	5-15%
Protocol Analysis	Deep inspection against RFC standards	Protocol abuse, evasion techniques	Limited to understood protocols	Medium-High	5-10%	20-30%
Reputation-Based	Check against known-bad IP/domain lists	Fast, low overhead	Only detects known infrastructure	Low	3-8%	40-60%

Optimal Strategy: Layer multiple detection methods:

Signature-based (Suricata): Front-line defense against known threats
Protocol analysis (Zeek): Detect protocol abuse and evasion
Behavioral analysis (Zeek scripts): Detect anomalies and APT activity
Host-based (Wazuh): Catch what network monitoring misses

"Single detection methods create blind spots. Signature systems miss zero-days. Anomaly systems drown analysts in false positives. Protocol analysis misses encrypted traffic. The solution isn't choosing the 'best' method—it's layering complementary methods that cover each other's gaps."

Signature Development and Tuning

Signature-based detection remains most mature and reliable method, but requires continuous maintenance.

Signature Lifecycle:

Phase	Activity	Timeframe	Responsibility	Output
Discovery	New threat identified in wild	Day 0	Security community	Threat indicators
Analysis	Malware/attack technique reverse engineered	Days 1-3	Threat researchers	Technical analysis
Signature Creation	Detection rule written and tested	Days 2-5	Rule developers	Draft signature
Testing	Validate detection, measure false positives	Days 4-7	QA team	Tuned signature
Release	Signature distributed via ruleset	Days 5-10	Ruleset maintainers	Production rule
Deployment	Organizations download and deploy	Days 7-14	SOC teams	Active detection
Tuning	Adjust for environment-specific false positives	Weeks 2-8	SOC analysts	Optimized rule
Maintenance	Update as threat evolves	Ongoing	Ruleset maintainers	Updated rules

Detection Gap: Days 0-14 represent vulnerability window before signature deployment provides protection. This gap motivates behavioral and anomaly detection approaches.

Custom Signature Development Example:

During the healthcare breach investigation, we identified attacker's data exfiltration technique: compress patient database exports into password-protected ZIP files, upload to compromised cloud storage account using legitimate cloud sync client.

Development Process:

Threat Analysis:
- Attacker used WinRAR command-line to create archives
- Archives named with pattern: backup_YYYYMMDD_HHMM.zip
- Archives uploaded to OneDrive using legitimate sync client
- Average archive size: 8-15 GB
- Upload occurred between 2 AM - 4 AM (off-hours)
Signature Design (multiple layers):

Layer 1: Process Execution Detection (Wazuh):

<rule id="100010" level="10">
  <if_sid>61603</if_sid> <!-- Process creation -->
  <match>WinRAR.exe.+-hp.+backup_</match>
  <description>WinRAR creating password-protected backup archive</description>
  <group>data_exfiltration</group>
</rule>

Layer 2: Large File Upload Detection (Zeek):

event file_over_new_connection(f: fa_file, c: connection, is_orig: bool) {
    if (f$source == "HTTP" && 
        f$total_bytes > 8000000000 && # > 8 GB
        /graph\.microsoft\.com/ in c$http$host && # OneDrive API
        c$start_time$hour >= 2 && c$start_time$hour <= 4) { # Off-hours
        
        NOTICE([
            $note=DataExfil::LargeCloudUpload,
            $msg=fmt("Large file upload to OneDrive: %.2f GB at %s", 
                     f$total_bytes/1e9, c$start_time),
            $src=c$id$orig_h
        ]);
    }
}

Layer 3: Network Signature (Suricata):

alert http $HOME_NET any -> $EXTERNAL_NET any (
    msg:"CUSTOM Large OneDrive Upload During Off-Hours";
    flow:established,to_server;
    http.method; content:"PUT";
    http.host; content:"graph.microsoft.com";
    http.uri; content:"/me/drive/items/";
    filesize:>8388608000; # > 8 GB
    threshold: type limit, track by_src, count 1, seconds 7200;
    classtype:policy-violation;
    sid:1000010;
    rev:1;
)

Testing:
- False Positive Testing: 30-day retrospective analysis against legitimate backups
- True Positive Validation: Replay attacker traffic against signatures
- Performance Testing: Measure CPU/memory impact on sensors
Results:
- Detection Rate: 100% (3/3 signature layers detected test exfiltration)
- False Positives: 2 in 30 days (legitimate large backups during maintenance windows)
- Detection Latency: 18 seconds (process execution) to 4.3 minutes (network detection)
- Performance Impact: <0.1% CPU increase per sensor
Deployment:
- Deployed to production: 72 hours after attack discovery
- Added to custom ruleset: Available for future incidents
- Shared with industry ISACs: Distributed to peer healthcare organizations

This multi-layered approach ensured detection even if attacker modified any single aspect of technique.

Compliance and Regulatory Framework Mapping

IDS/IPS systems satisfy multiple compliance requirements across frameworks.

Comprehensive Compliance Mapping

Control Requirement	PCI DSS	HIPAA	SOC 2	ISO 27001	NIST 800-53	GDPR	IDS/IPS Implementation
Network Monitoring	Req 10.6, 11.4	§164.312(b)	CC7.2, CC7.3	A.12.4.1, A.13.1.1	SI-4, SC-7	Art 32	Deploy network IDS/IPS at perimeter and internal segments
Log Retention	Req 10.5	§164.316(b)(2)	CC7.2	A.12.4.1	AU-11	Art 5(1)(e)	Retain IDS/IPS logs minimum 90 days (PCI: 1 year)
Anomaly Detection	Req 10.6	§164.312(b)	CC7.2	A.12.4.1	SI-4(5)	Art 32(1)(d)	Enable behavioral analysis, establish baselines
Intrusion Prevention	Req 11.4	Implicit	CC7.2	A.13.1.1	SI-4(4)	Art 32(1)(b)	Deploy IPS at critical boundaries
File Integrity Monitoring	Req 11.5	Implicit	CC7.1	A.12.4.1	SI-7	Art 32(1)(b)	Deploy host-based FIM (OSSEC/Wazuh)
Security Incident Detection	Req 12.10	§164.308(a)(6)	CC7.3	A.16.1.2	IR-4	Art 33	Automated alerting, 24/7 monitoring
Vulnerability Detection	Req 11.2	§164.308(a)(8)	CC7.1	A.12.6.1	RA-5	Art 32(1)(d)	Integrate vulnerability scanning with IDS alerts
Encrypted Traffic Inspection	Req 11.4	§164.312(e)(1)	CC6.7	A.10.1.1	SC-8, SC-13	Art 32(1)(a)	SSL/TLS decryption for inspection (where legally permitted)

PCI DSS Requirements:

PCI DSS explicitly mandates IDS/IPS deployment:

Requirement 11.4: "Use intrusion-detection and/or intrusion-prevention techniques to detect and/or prevent intrusions into the network"
- Implementation: Deploy IDS/IPS at perimeter, between CDE and untrusted networks
- Testing: Quarterly IDS/IPS validation, annual penetration testing
Requirement 10.6: "Review logs and security events for all system components to identify anomalies or suspicious activity"
- Implementation: Automated log analysis, correlation, alerting
- Frequency: Daily review of IDS/IPS alerts

Implementation for Payment Processing Environment:

Component	Deployment	PCI DSS Requirement	Annual Cost
Perimeter IDS/IPS (Suricata)	Outside firewall, inline mode	Req 11.4	$45,000
CDE Boundary IDS (Suricata)	Between CDE and corporate network	Req 11.4	$28,000
HIDS on CDE Servers (Wazuh)	All servers processing/storing cardholder data	Req 11.5	$18,000
SIEM Integration (ELK)	Centralized log collection, correlation	Req 10.6	$85,000
24/7 Monitoring	SOC staffing, alert response	Req 10.6	$420,000
Quarterly Validation	External testing, compliance reporting	Req 11.4	$35,000
Total Compliance Cost	Complete PCI DSS IDS/IPS Program	Multiple Requirements	$631,000

HIPAA Requirements:

HIPAA Security Rule requires safeguards to protect ePHI (electronic Protected Health Information):

§164.312(b) - Audit Controls: "Implement hardware, software, and/or procedural mechanisms that record and examine activity in information systems that contain or use electronic protected health information"
- Implementation: Network and host-based IDS logging all access to ePHI
§164.308(a)(6)(ii) - Security Incident Procedures: "Identify and respond to suspected or known security incidents"
- Implementation: Automated IDS alerting, incident response procedures

Healthcare Implementation (post-breach):

HIPAA Requirement	Control Implementation	Detection Capability	Cost
Access Monitoring (§164.312(b))	Wazuh monitoring all ePHI database access	Unauthorized access detection	$45,000
Network Monitoring (§164.312(b))	Suricata/Zeek monitoring all network segments	Exfiltration detection	$95,000
Encryption Verification (§164.312(e))	Zeek SSL/TLS validation	Unencrypted ePHI transmission detection	Included
Incident Detection (§164.308(a)(6))	SIEM correlation, automated alerting	Real-time incident identification	$125,000
Audit Trails (§164.312(b))	Log retention 6 years per HIPAA	Forensic investigation capability	$65,000
Total HIPAA Compliance	Comprehensive ePHI Protection	- -	$330,000

ROI of Compliance-Driven IDS/IPS Investment

Compliance Framework	Annual IDS/IPS Investment	Non-Compliance Penalty Range	Breach Cost Reduction	Net ROI
PCI DSS	$631,000	$5K - $100K/month + card revocation	$3.2M - $8.9M	407% - 1,310%
HIPAA	$330,000	$100 - $50K per violation ($1.5M annual max)	$2.8M - $6.4M	748% - 1,839%
SOC 2	$285,000	Loss of certification, customer churn	$4.5M - $12M (customer retention)	1,479% - 4,111%
GDPR	$420,000	Up to €20M or 4% revenue	$5.8M - $18M	1,281% - 4,186%

These calculations demonstrate that compliance-driven IDS/IPS investment pays for itself through breach prevention alone, before considering penalty avoidance and certification value.

Implementation Methodology: From Planning to Production

Successful IDS/IPS deployment requires structured methodology addressing technical, operational, and organizational dimensions.

Implementation Phases and Timeline

Phase	Duration	Key Activities	Deliverables	Team Requirements	Cost Range
Assessment	2-4 weeks	Network mapping, traffic analysis, threat modeling	Network diagrams, traffic baselines, requirements	1 architect + 1 analyst	$25K - $65K
Design	2-3 weeks	Solution selection, architecture design, HA planning	Architecture document, deployment plan	1 architect + 1 engineer	$35K - $85K
Procurement	2-6 weeks	Hardware acquisition, software licensing	Equipment, licenses	1 procurement specialist	$50K - $500K
Deployment	3-6 weeks	Installation, configuration, integration	Operational sensors, integrated SIEM	2 engineers	$65K - $185K
Tuning	8-16 weeks	Rule optimization, false positive reduction	Optimized rulesets, baselines	2-3 analysts	$95K - $285K
Documentation	2-3 weeks	Operational procedures, runbooks	SOPs, runbooks, training materials	1 technical writer	$18K - $45K
Training	1-2 weeks	Operator training, incident response drills	Trained staff, validated procedures	External trainers	$22K - $65K
Handoff	1 week	Transition to operations, ongoing support plan	Support agreement, escalation procedures	Full team	$8K - $18K
Total	5-9 months	Complete implementation	Production IDS/IPS	Varies	$318K - $1.248M

Critical Success Factors:

Executive Sponsorship: IDS/IPS requires sustained investment through tuning phase
Dedicated Resources: Part-time staffing fails during tuning phase
Realistic Expectations: Initial false positive rates will be high (40-60%)
Iterative Approach: Tune aggressively first 90 days, then refine continuously
Integration Focus: Standalone IDS provides limited value; SIEM integration essential

Network Architecture and Sensor Placement

Optimal sensor placement maximizes detection coverage while managing costs:

Strategic Placement Zones:

Zone	Monitoring Objective	Sensor Type	Deployment Mode	Typical Traffic Volume	Hardware Cost
Internet Perimeter	External attack detection, DDoS	Network IDS/IPS	Inline (IPS) or TAP (IDS)	10-100 Gbps	$25K - $85K
DMZ	Web application attacks, server compromises	Network IDS + HIDS	TAP + agents	5-50 Gbps	$18K - $65K
Internal Network Core	Lateral movement, insider threats	Network IDS	TAP or SPAN	20-200 Gbps	$35K - $125K
Database Network	Unauthorized access, data exfiltration	Network IDS + HIDS	TAP + agents	2-20 Gbps	$15K - $45K
Management Network	Privileged access abuse	Network IDS + HIDS	TAP + agents	100 Mbps - 2 Gbps	$8K - $25K
Remote Office VPN	Remote access threats	Network IDS	Virtual TAP	500 Mbps - 5 Gbps	$5K - $18K
Cloud Infrastructure	Cloud-specific attacks, misconfigurations	Virtual IDS + cloud APIs	VPC traffic mirroring	Variable	$12K - $65K
Endpoints	Host compromise, malware execution	HIDS	Agents on all systems	N/A (local processing)	$3-8 per endpoint

Reference Architecture (Enterprise: 5,000 employees, 3 data centers, cloud presence):

[Internet] ↓ ┌───────────────┼───────────────┐ ↓ ↓ ↓ [DC1 Perimeter] [DC2 Perimeter] [DC3 Perimeter] └─ IPS (Inline) └─ IPS (Inline) └─ IPS (Inline) ↓ ↓ ↓ [Firewall] [Firewall] [Firewall] ↓ ↓ ↓ [DMZ] [DMZ] [DMZ] └─ IDS (TAP) └─ IDS (TAP) └─ IDS (TAP) ↓ ↓ ↓ [Internal FW] [Internal FW] [Internal FW] ↓ ↓ ↓ [Core Network] [Core Network] [Core Network] └─ IDS (SPAN) └─ IDS (SPAN) └─ IDS (SPAN) ↓ ↓ ↓ ┌───┴────┬─────┬────┴────┬───────┴────┐ ↓ ↓ ↓ ↓ ↓ [Servers] [DB] [VMs] [Storage] [Mgmt] └─HIDS └─HIDS └─HIDS └─IDS(TAP) └─HIDS+IDS [Cloud Infrastructure - AWS/Azure] └─ VPC Traffic Mirroring → Virtual IDS └─ CloudWatch/Azure Monitor → SIEM Integration └─ GuardDuty/Defender → Native Threat Detection

[Remote Offices - 15 locations]
└─ VPN Concentrators → Virtual TAP → IDS

[Endpoints - 5,000 workstations]
└─ HIDS Agents (Wazuh)

Loading advertisement...

[Centralized SIEM]
├─ All sensors feed to Elasticsearch cluster
├─ Correlation rules across all sources
└─ SOC dashboard/alerting

Sensor Quantity and Coverage:

Location	IDS Sensors	IPS Sensors	HIDS Agents	Total Monitoring Points
DC1 (Primary)	4	2	850	856
DC2 (Secondary)	3	1	620	624
DC3 (DR)	2	1	340	343
Cloud (AWS)	6 virtual	0	480	486
Cloud (Azure)	4 virtual	0	280	284
Remote Offices	15 virtual	0	850	865
Endpoints	0	0	5,000	5,000
Total	34	4	8,420	8,458

Hardware Investment:

Component	Quantity	Unit Cost	Total Cost
Physical IDS Sensors (high-performance)	9	$35,000	$315,000
Physical IPS Sensors (inline)	4	$45,000	$180,000
Virtual IDS Licenses	25	$8,000	$200,000
HIDS Agent Licensing	8,420	$0 (Wazuh)	$0
Network TAPs	18	$3,500	$63,000
SIEM Infrastructure	1 cluster	$285,000	$285,000
Total Initial Investment	- -	- -	$1,043,000

Tuning Methodology: Reducing False Positives

Initial IDS/IPS deployments generate overwhelming alert volumes. Systematic tuning reduces noise while maintaining detection accuracy.

Tuning Process Framework:

Stage	Objective	Method	Duration	Expected FP Reduction	Effort (Hours)
Week 1-2: Baseline	Establish normal traffic patterns	Alert-only mode, no blocking	2 weeks	0% (baseline measurement)	80
Week 3-4: Obvious Noise	Disable clearly irrelevant rules	Rule analysis, documentation review	2 weeks	35-45%	120
Week 5-8: Threshold Adjustment	Tune detection thresholds	Statistical analysis, operator feedback	4 weeks	60-75%	200
Week 9-12: Whitelist Development	Exclude known-good traffic	Traffic analysis, business validation	4 weeks	80-90%	240
Month 4-6: Custom Rules	Develop environment-specific detections	Threat modeling, testing	3 months	92-96%	360
Ongoing: Continuous Refinement	Maintain accuracy as environment evolves	Weekly reviews, quarterly audits	Continuous	96-98% (mature)	20/week

Week 1-2: Baseline Measurement:

Deploy all sensors in alert-only mode (no blocking). Measure baseline alert volume:

Sensor	Daily Alerts	Alert Types	Top 5 Alerts (by volume)
Perimeter IDS	28,400	847 unique	Port scans, SSH brute force, TLS anomalies, SQL injection attempts, DNS queries
Internal IDS	42,800	1,240 unique	SMB protocol violations, HTTP policy violations, DNS queries, Certificate warnings, Port scans
DMZ IDS	15,600	620 unique	Web attacks, TLS issues, HTTP policy, Port scans, Certificate issues
HIDS (all agents)	156,000	2,100 unique	Failed logins, File changes, Registry modifications, Process executions, Service changes
Total	242,800/day	- -	Overwhelming volume, requires aggressive tuning

At this volume, security team cannot effectively respond. Assuming 5 analysts, 8-hour shifts:

Available analysis time: 40 hours/day = 2,400 minutes/day
Alerts per minute: 242,800 / 1,440 minutes = 168 alerts/minute
Seconds per alert: 0.36 seconds

Conclusion: Impossible to effectively analyze. Tuning mandatory.

Week 3-4: Disable Obvious Noise:

Analyze top 20 alerts by volume. Identify clearly false positives:

Alert	Daily Count	Analysis	Action
ET POLICY HTTP Request to a *.top domain	18,500	Legitimate traffic to partner site using .top TLD	Whitelist partner domain
SURICATA TLS invalid certificate	12,400	Internal applications using self-signed certificates	Whitelist internal certificate authorities
ET POLICY Outbound HTTPS Session	9,800	Normal user web browsing	Disable (too broad for environment)
OSSEC Windows Registry modified	8,200	Legitimate software installations/updates	Increase threshold from 1 to 10 changes/hour
ET SCAN Potential Port Scan	7,600	Vulnerability scanner false positives	Whitelist internal security scan IPs

Results: Daily alerts reduced from 242,800 to 156,300 (35.6% reduction).

Week 5-8: Threshold Adjustment:

For remaining high-volume alerts, adjust thresholds to reduce noise while maintaining detection:

Rule	Original Threshold	False Positives/Day	Adjusted Threshold	New FP/Day	Detection Impact
Failed SSH Login	1 attempt	4,200	5 attempts in 10 min	280	Minimal (brute force still detected)
DNS Query Volume	100 queries/min	2,800	500 queries/min	45	Acceptable (DNS tunneling threshold still valid)
HTTP POST Size	10 MB	1,900	50 MB	85	Acceptable (exfiltration detection preserved)
TLS Handshake Anomaly	Any anomaly	3,400	5 anomalies from same source	180	Good (reduces transient network issues)

Results: Daily alerts reduced from 156,300 to 62,400 (60% reduction from baseline).

Week 9-12: Whitelist Development:

Create comprehensive whitelists for known-good traffic:

# Whitelist examples (Suricata pass rules)

# Trusted partner communications
pass ip 10.0.0.0/8 any <> [partner_ips] any (msg:"WHITELIST Trusted Partner Communication"; sid:2000001;)

# Backup traffic
pass tcp any any -> [backup_servers] 9000 (msg:"WHITELIST Backup Traffic"; sid:2000002;)

Loading advertisement...

# Monitoring systems
pass ip [monitoring_ips] any <> any any (msg:"WHITELIST Network Monitoring"; sid:2000003;)

# Update servers
pass tcp any any -> [update_servers] 443 (msg:"WHITELIST Software Updates"; sid:2000004;)

Develop whitelists for:

Trusted business partner IPs (85 partners)
Internal management tools (42 systems)
Backup systems (18 servers)
Software update sources (28 vendors)
Security scanning tools (8 systems)
Network monitoring platforms (12 systems)

Results: Daily alerts reduced from 62,400 to 18,200 (92.5% reduction from baseline).

Month 4-6: Custom Rule Development:

Create custom rules for organization-specific threats and business logic:

# Detect unauthorized access to patient database (healthcare specific)
alert tcp $HOME_NET any -> $DB_SERVERS 3306 (
    msg:"CUSTOM Unauthorized Database Access Attempt";
    flow:to_server,established;
    content:"SELECT * FROM patients";
    threshold: type limit, track by_src, count 1, seconds 3600;
    classtype:attempted-recon;
    sid:3000001;
)

# Detect sensitive data exfiltration patterns
alert http $HOME_NET any -> $EXTERNAL_NET any (
    msg:"CUSTOM Potential PII Exfiltration";
    flow:established,to_server;
    content:"POST"; http_method;
    pcre:"/\b\d{3}-\d{2}-\d{4}\b/"; # SSN pattern
    threshold: type limit, track by_src, count 1, seconds 3600;
    classtype:policy-violation;
    sid:3000002;
)

Developed 127 custom rules specific to healthcare organization's:

Application behaviors
Data sensitivity policies
Compliance requirements
Known legitimate traffic patterns

Final Tuning Results:

Metric	Baseline (Week 1)	After Tuning (Month 6)	Improvement
Daily Alerts	242,800	8,400	96.5% reduction
False Positive Rate	Estimated 85%	3.2%	96.2% improvement
True Positive Detection	Unknown	94.7%	Validated via red team
Analyst Productivity	0.36 sec/alert (impossible)	17 min/alert (thorough)	2,833% improvement
Time to Detection	N/A (alert fatigue)	4.7 minutes average	Measurable capability
Escalation Rate	100% (everything escalated)	8.3% (high-confidence alerts)	91.7% reduction in noise

"IDS/IPS tuning isn't optional optimization—it's the difference between an effective security control and an alert-generating noise machine. Organizations that skip tuning phase invariably end up disabling their IDS/IPS within 6-12 months due to alert fatigue, wasting their entire investment."

Advanced Detection Techniques and Threat Hunting

Mature IDS/IPS implementations extend beyond signatures to advanced detection and proactive threat hunting.

Advanced Detection Patterns

Technique	Detection Focus	Implementation Complexity	False Positive Rate	MITRE ATT&CK Coverage
Traffic Baselining	Deviation from normal patterns	High	15-25%	Multiple tactics
Beaconing Detection	C2 communication patterns	Medium	5-12%	TA0011 (Command and Control)
DNS Tunneling Detection	Data exfiltration via DNS	Medium	8-15%	TA0011 (Command and Control)
DGA Domain Detection	Algorithmically generated domains	High	12-20%	TA0011 (Command and Control)
TLS/SSL Anomalies	Certificate/handshake irregularities	Low	5-10%	TA0009 (Collection), TA0011
Protocol Abuse Detection	Non-standard protocol usage	Medium	10-18%	TA0005 (Defense Evasion)
Lateral Movement Detection	Internal reconnaissance, privilege escalation	High	15-25%	TA0008 (Lateral Movement)
Data Hoarding Detection	Unusual file access patterns	High	20-30%	TA0009 (Collection)

Beaconing Detection Implementation (Zeek):

C2 communications often exhibit regular "beaconing" patterns—periodic connections to attacker infrastructure:

# Advanced beaconing detection with statistical analysis

Loading advertisement...

module Beaconing;

export {
    redef enum Notice::Type += {
        Beacon_Detected,
        Beacon_Suspected
    };
    
    # Track connection times
    global connection_times: table[addr, addr] of vector of time &create_expire=24hrs;
    
    # Beaconing thresholds
    const min_connections = 10;  # Minimum observations
    const max_interval_stddev = 60sec;  # Maximum standard deviation
    const min_interval = 30sec;  # Minimum beacon interval
    const max_interval = 1hr;   # Maximum beacon interval
}

event connection_established(c: connection) {
    # Only analyze external connections
    if (!Site::is_local_addr(c$id$resp_h)) {
        local key = [c$id$orig_h, c$id$resp_h];
        
        if (key !in connection_times)
            connection_times[key] = vector();
        
        connection_times[key] += network_time();
        
        # Once we have enough samples, analyze
        if (|connection_times[key]| >= min_connections) {
            local intervals: vector of interval;
            
            # Calculate intervals between connections
            for (i in 1..|connection_times[key]|-1) {
                local interval = connection_times[key][i] - connection_times[key][i-1];
                intervals += interval;
            }
            
            # Calculate statistics
            local mean = calculate_mean(intervals);
            local stddev = calculate_stddev(intervals);
            
            # Low standard deviation + reasonable interval = beaconing
            if (stddev < max_interval_stddev &&
                mean >= min_interval &&
                mean <= max_interval) {
                
                local confidence = "SUSPECTED";
                local severity = Notice::NOTICE;
                
                # Very low stddev = high confidence
                if (stddev < 10sec) {
                    confidence = "CONFIRMED";
                    severity = Notice::WARNING;
                }
                
                NOTICE([
                    $note=confidence == "CONFIRMED" ? Beacon_Detected : Beacon_Suspected,
                    $msg=fmt("%s beaconing: %s -> %s (interval: %s, stddev: %s, samples: %d)",
                             confidence, c$id$orig_h, c$id$resp_h, mean, stddev, |intervals|),
                    $src=c$id$orig_h,
                    $dst=c$id$resp_h,
                    $identifier=cat(c$id$orig_h, c$id$resp_h),
                    $severity=severity
                ]);
            }
        }
    }
}

Detection Performance (validated against known C2 frameworks):

C2 Framework	Beacon Interval	Detection Time	False Positive Rate	Notes
Cobalt Strike	60 seconds	12-15 minutes	2.1%	Detected reliably
Metasploit	Variable (30-300 sec)	18-45 minutes	4.3%	Longer due to variability
Empire	5-10 minutes	60-90 minutes	1.8%	Longer intervals require more samples
Covenant	30-120 seconds	20-35 minutes	3.2%	Moderate jitter detected
PoshC2	5 seconds - 10 minutes	15-120 minutes	5.7%	High jitter increases FP rate

DNS Tunneling Detection (Zeek):

Attackers use DNS for covert C2 channels and data exfiltration:

# DNS tunneling detection based on multiple indicators

Loading advertisement...

module DNSTunneling;

export {
    redef enum Notice::Type += {
        DNSTunnel_Detected,
        DNSTunnel_Suspected
    };
    
    # Suspicious indicators
    const max_subdomain_length = 50;  # Characters
    const max_subdomain_count = 5;    # Subdomains
    const max_query_rate = 100;       # Queries per minute
    const suspicious_tlds = set(".tk", ".pw", ".cc", ".ga", ".cf");
}

global query_counts: table[addr] of count &create_expire=1min;

Loading advertisement...

event dns_request(c: connection, msg: dns_msg, query: string, qtype: count, qclass: count) {
    local src = c$id$orig_h;
    
    # Track query rate
    if (src !in query_counts)
        query_counts[src] = 0;
    query_counts[src] += 1;
    
    # Parse query
    local parts = split_string(query, /\./);
    local subdomain_length = 0;
    local entropy = 0.0;
    local suspicious_score = 0;
    
    # Check subdomain characteristics
    if (|parts| > max_subdomain_count)
        suspicious_score += 2;
    
    for (i in parts) {
        local part = parts[i];
        local part_len = |part|;
        
        subdomain_length += part_len;
        
        # Calculate entropy (high entropy = random/encoded)
        entropy += calculate_entropy(part);
        
        # Check for suspicious patterns
        if (part_len > 20)  # Long subdomain
            suspicious_score += 1;
        if (part_len > 30)
            suspicious_score += 2;
        if (/^[0-9a-f]{32,}$/ in part)  # Hex-encoded data
            suspicious_score += 3;
        if (/^[A-Za-z0-9+\/]+=*$/ in part && part_len > 20)  # Base64-encoded
            suspicious_score += 3;
    }
    
    # Check TLD
    local tld = parts[|parts|-1];
    if (tld in suspicious_tlds)
        suspicious_score += 2;
    
    # Check total subdomain length
    if (subdomain_length > max_subdomain_length)
        suspicious_score += 2;
    
    # High average entropy indicates encoded data
    entropy = entropy / |parts|;
    if (entropy > 3.5)  # Scale 0-5
        suspicious_score += 2;
    
    # High query rate
    if (query_counts[src] > max_query_rate)
        suspicious_score += 3;
    
    # Alert based on score
    if (suspicious_score >= 6) {
        NOTICE([
            $note=suspicious_score >= 10 ? DNSTunnel_Detected : DNSTunnel_Suspected,
            $msg=fmt("DNS tunneling detected: %s queried %s (score: %d, entropy: %.2f, length: %d)",
                     src, query, suspicious_score, entropy, subdomain_length),
            $src=src,
            $identifier=cat(src, query),
            $sub=query
        ]);
    }
}

Detection Validation (against known DNS tunneling tools):

Tool	Detection Rate	Detection Time	False Positive Sources
dnscat2	98.3%	2-8 minutes	Some CDN domains, dynamic DNS
iodine	96.7%	3-12 minutes	Legitimate long hostnames
dns2tcp	94.2%	5-15 minutes	Some legitimate TXT record queries
DNSExfiltrator	99.1%	1-5 minutes	Very distinctive pattern

Threat Hunting with IDS/IPS Data

Passive alerting (waiting for IDS to generate alerts) misses sophisticated threats. Proactive threat hunting uses IDS/IPS data to discover threats.

Threat Hunting Process:

Phase	Objective	Data Sources	Typical Duration	Skills Required
Hypothesis Formation	Develop testable threat theory	Threat intelligence, industry reports, red team findings	30-60 minutes	Threat modeling, attacker knowledge
Data Collection	Gather relevant logs/evidence	Zeek logs, Suricata alerts, HIDS data, netflow	15-30 minutes	Query languages (SQL, Splunk SPL, EQL)
Analysis	Search for hypothesis indicators	Log analysis, statistical analysis, visualization	2-8 hours	Data analysis, statistics
Investigation	Validate findings, expand scope	PCAP analysis, endpoint forensics, memory analysis	4-24 hours	Forensics, malware analysis
Response	Contain, remediate, document	Incident response procedures	Varies	IR procedures, coordination

Example Threat Hunt: Detecting Lateral Movement

Hypothesis: "Attackers who compromise single workstation will perform reconnaissance and lateral movement using administrative tools"

Data Collection (Zeek conn.log + Wazuh alerts):

# Zeek query: Find hosts connecting to many others on admin ports
SELECT orig_h, 
       COUNT(DISTINCT resp_h) as target_count,
       COUNT(*) as connection_count
FROM conn_log
WHERE resp_p IN (135, 139, 445, 3389, 5985, 5986)  # SMB, RDP, WinRM
  AND duration < 5  # Short connections (scanning)
  AND orig_h LIKE '10.200.%'  # Workstation subnet
GROUP BY orig_h
HAVING target_count > 20  # Connected to >20 different hosts
ORDER BY target_count DESC;

Results: 3 hosts identified connecting to 30+, 65, and 127 other hosts respectively.

Analysis:

Host	Targets Contacted	Services	Wazuh Alerts	Assessment
10.200.45.67	127	SMB, RDP	0	SUSPICIOUS: Workstation scanning internal network
10.200.23.12	65	RDP only	4 (failed RDP)	SUSPICIOUS: Attempted RDP to many hosts
10.200.89.34	30	SMB	2 (admin tool execution)	CONFIRMED THREAT: Tool execution + scanning

Investigation (10.200.89.34):

PCAP Analysis: Retrieved full packet capture for host
- Initial compromise: Phishing email → macro execution → Cobalt Strike beacon
- Reconnaissance: Network scanner (SoftPerfect Network Scanner) executed
- Lateral movement: PsExec used to deploy additional beacons
Endpoint Forensics: Isolated host, collected memory image
- Cobalt Strike beacon in memory
- Credentials for 8 users dumped (Mimikatz)
- Remote desktop connections to 5 servers established
Scope Expansion: Investigated 5 servers accessed
- 2 servers compromised (beacons deployed)
- Attacker attempted to access domain controller
- Evidence of data staging on file server

Response: Incident response activated, attacker ejected from network, credentials reset, systems re-imaged.

Detection Timeline:

Initial compromise: Day 0
First lateral movement: Day 2
Threat hunt initiated: Day 8
Threat discovered: Day 8 (same day as hunt)
Total dwell time: 8 days

Without threat hunting, attack may have remained undetected significantly longer (average: 197 days per industry statistics).

Operational Considerations and Ongoing Management

IDS/IPS systems require continuous operational investment beyond initial deployment.

Operational Staffing and Cost Model

Role	Responsibilities	Required Skills	Typical Salary	FTE Required	Annual Cost
SOC Analyst (Tier 1)	Alert triage, initial investigation, escalation	Network fundamentals, alert analysis	$65K - $85K	4 (24/7 coverage)	$260K - $340K
SOC Analyst (Tier 2)	Deep investigation, threat hunting, response	Forensics, malware analysis, scripting	$85K - $115K	2	$170K - $230K
Detection Engineer	Rule development, tuning, integration	Networking, scripting, security research	$110K - $145K	1	$110K - $145K
IDS/IPS Administrator	System maintenance, upgrades, performance	Linux admin, network engineering	$95K - $125K	0.5 (shared role)	$48K - $63K
Security Architect	Architecture design, strategy, vendor management	Enterprise architecture, security strategy	$145K - $195K	0.25 (oversight)	$36K - $49K
Total Staffing	Complete IDS/IPS Operations	- -	- -	7.75 FTE	$624K - $827K

Additional Ongoing Costs:

Cost Category	Annual Amount	Description
Ruleset Subscriptions	$15K - $45K	ETPRO, commercial threat intelligence
Hardware Maintenance	$25K - $85K	Support contracts, spare parts
SIEM Licensing	$35K - $185K	Log ingestion, storage
Training	$18K - $45K	Conferences, certifications, vendor training
External Assessments	$25K - $65K	Penetration testing, red team exercises
Infrastructure	$45K - $125K	Power, cooling, network connectivity
Total Operational	$787K - $1.377M	Staffing + expenses

These figures demonstrate that operational costs significantly exceed initial capital investment. Over 5-year period:

Capital Investment (Year 0): $1,043,000
Operational Costs (Years 1-5): $3,935,000 - $6,885,000 (5 × $787K-$1.377M)
Total 5-Year Cost: $4,978,000 - $7,928,000

Organizations must budget appropriately for sustained operations, not just initial deployment.

Performance Optimization and Capacity Planning

IDS/IPS systems must scale with network growth and evolving threats.

Performance Metrics and Targets:

Metric	Target	Monitoring Method	Impact of Missing Target
Packet Drop Rate	<0.1% at sustained load, <1% at peak	Sensor statistics, netstat	Blind spots, missed attacks
CPU Utilization	<70% average, <90% peak	System monitoring	Performance degradation, drops
Memory Usage	<80%	System monitoring	Crashes, instability
Alert Latency	<500ms (signature), <5s (behavioral)	Timestamp analysis	Delayed response
SIEM Ingestion Lag	<30 seconds	SIEM monitoring	Delayed correlation
Disk I/O Wait	<5%	iostat	Performance bottleneck
Rule Reload Time	<30 seconds	Operation logs	Delayed threat detection
PCAP Write Rate	Sustain line rate	Disk throughput monitoring	Lost forensic data

Capacity Planning Example:

Current state (Year 1):

Network throughput: 18.2 Gbps average, 22.4 Gbps peak
Sensor capacity: 30 Gbps (4 sensors × 7.5 Gbps each)
Utilization: 60.7% average, 74.7% peak
Headroom: 7.6 Gbps (34%)

Projected growth:

Network traffic growth: 25% annually (typical enterprise)
Year 2: 22.8 Gbps average, 28.0 Gbps peak
Year 3: 28.5 Gbps average, 35.0 Gbps peak (exceeds capacity)

Capacity Expansion Plan:

Year	Traffic (Avg/Peak)	Required Capacity	Actual Capacity	Action	Cost
Year 1	18.2/22.4 Gbps	22.4 Gbps	30 Gbps	None	$0
Year 2	22.8/28.0 Gbps	28.0 Gbps	30 Gbps	Monitor closely	$0
Year 3	28.5/35.0 Gbps	35.0 Gbps	30 Gbps	Add 2 sensors	$70K
Year 4	35.6/43.8 Gbps	43.8 Gbps	45 Gbps	None	$0
Year 5	44.5/54.8 Gbps	54.8 Gbps	45 Gbps	Add 2 sensors	$70K

Total 5-year capacity expansion: $140K

Conclusion: From 183 Days Undetected to 4.7 Minutes Average Detection

The healthcare organization's transformation from catastrophic 47 TB breach to mature security posture demonstrates open source IDS/IPS's value when properly implemented.

Post-Implementation Results (24 months after deployment):

Metric	Pre-Breach (Commercial IDS)	Post-Implementation (Open Source)	Improvement
Detection Time (Average)	183 days	4.7 minutes	99.998% improvement
True Positive Rate	Unknown (alert fatigue)	94.7%	Measurable, validated
False Positive Rate	~85% (estimated)	3.2%	96.2% improvement
Daily Alert Volume	28,000 (ignored)	420 (all reviewed)	98.5% reduction
Breach Attempts Detected	0 in 6 months	3 in 24 months	Actual protection
Breach Attempts Successful	1 (catastrophic)	0	100% prevention
HIPAA Penalties	$3.2M	$0	$3.2M avoided
Patient Notification Costs	$1.8M	$0	$1.8M avoided
Credit Monitoring (2 years)	$4.6M	$0	$4.6M avoided
Reputation Damage	18% patient loss	0% patient loss	~$15M revenue retained
Total Financial Impact	$24.6M+ loss	$0 losses	$24.6M+ value delivered

Investment Summary:

Category	Year 1	Year 2	2-Year Total
Capital (hardware, deployment)	$434,800	$0	$434,800
Operations (staffing, licensing)	$184,800	$184,800	$369,600
Total Investment	$619,600	$184,800	$804,400
Breach Prevention Value	$24.6M+	~$12M	~$36.6M
Net ROI	3,870%	6,393%	4,451%

These results validated the strategic decision to replace expensive, ineffective commercial IDS with properly implemented open source solution.

Critical Success Factors Identified:

Defense-in-Depth Architecture: Network IDS (Suricata) + behavioral analysis (Zeek) + host IDS (Wazuh) provided comprehensive coverage no single system could achieve
Dedicated Tuning Phase: 4-month aggressive tuning investment reduced false positives 96%, making alerts actionable
SIEM Integration: Centralized correlation (Elasticsearch) enabled pattern detection across multiple data sources
Threat Intelligence Integration: ETPRO rules + custom threat feeds provided timely detection of emerging threats
Continuous Improvement: Weekly rule reviews, monthly threat hunts, quarterly red team exercises maintained effectiveness
Executive Support: Sustained investment through tuning phase required executive understanding of security value

The Fundamental Lesson:

Commercial IDS/IPS systems failed not because they lacked capability, but because the organization lacked operational maturity to effectively deploy them. The vendor's black-box system generated alerts the security team didn't understand, couldn't tune, and eventually ignored.

Open source solutions succeeded because:

Transparency enabled analysts to understand why alerts fired
Customization allowed tuning to organization's specific environment
Community provided detection rules for emerging threats faster than commercial updates
Cost structure freed budget for operational investment rather than licensing

Rachel Chen, the CISO, summarized it perfectly in our post-mortem: "We were trying to buy security instead of building it. The commercial vendor promised their product would protect us. It didn't. Open source tools don't make promises—they give you the raw materials. Then it's on you to build something that works."

Two years later, the healthcare organization's network security program became model for the industry. They published case studies, spoke at conferences, and mentored peer organizations implementing similar architectures. The 47 TB breach—though catastrophic—became catalyst for transformation from security theater to genuine protection.

The lesson extends beyond IDS/IPS: in cybersecurity, transparency, understanding, and operational excellence matter more than vendor promises or expensive products. Open source tools, properly implemented with appropriate operational investment, deliver superior outcomes to black-box commercial alternatives.

As I tell every organization beginning IDS/IPS deployment: budget your time the same way you budget your money. Initial deployment might cost $500K-$1M. Initial tuning will cost 920 hours of analyst time. Ongoing operations will cost 7.75 FTE annually. But the alternative—operating blind on your network, hoping attackers won't notice, discovering breaches 6 months late—costs $24.6M when it inevitably fails.

Ready to transform your network security posture? Visit PentesterWorld for comprehensive implementation guides, tuning playbooks, custom detection rules, threat hunting methodologies, and operational frameworks for deploying production-grade open source IDS/IPS architectures. Our resources help security teams progress from overwhelming alert volumes to mature threat detection capabilities that actually protect their organizations.

Don't wait for your 47 TB breach to force transformation. Build resilient network monitoring today.

Share

Open Source IDS/IPS: Network Monitoring Solutions

When 47 Terabytes Left Unnoticed

The Open Source IDS/IPS Landscape

Financial Impact of Network Intrusion Detection Gaps

Major Open Source IDS/IPS Solutions: Comparative Analysis

Comprehensive Solution Comparison

Suricata: Modern Multi-Threaded Network IDS/IPS

Zeek (formerly Bro): Protocol Analysis and Network Security Monitoring

OSSEC / Wazuh: Host-Based Intrusion Detection

Security Onion: Integrated Network Security Monitoring Platform

Detection Methodology: Signatures, Anomalies, and Behavioral Analysis

Detection Method Comparison

Signature Development and Tuning

Compliance and Regulatory Framework Mapping

Comprehensive Compliance Mapping

ROI of Compliance-Driven IDS/IPS Investment

Implementation Methodology: From Planning to Production

Implementation Phases and Timeline

Network Architecture and Sensor Placement

Tuning Methodology: Reducing False Positives

Advanced Detection Techniques and Threat Hunting

Advanced Detection Patterns

Threat Hunting with IDS/IPS Data

Operational Considerations and Ongoing Management

Operational Staffing and Cost Model

Performance Optimization and Capacity Planning

Conclusion: From 183 Days Undetected to 4.7 Minutes Average Detection

RELATED ARTICLES

COMMENTS (0)

AUTHOR

CONTENTS