Supported log formats
Graph Hunter supports multiple log formats. Use Auto-detect to let the engine choose the parser from content heuristics, or select a format manually.
Auto-detect (recommended)
The engine identifies the format as follows:
Content heuristic |
Parser used |
|---|---|
JSON with |
Sysmon |
JSON with Sentinel |
Microsoft Sentinel |
Other JSON |
Generic (field normalization) |
Non-JSON (e.g. CSV) |
CSV (then Generic) |
Sysmon (Windows Event Log)
Parses Sysmon JSON (array or NDJSON). Supported event IDs and triples:
Event ID |
Description |
Triples |
|---|---|---|
1 |
Process Create |
|
3 |
Network Connection |
|
11 |
File Create |
|
22 |
DNS Query |
|
Microsoft Sentinel (Azure)
Parses Sentinel-style JSON (e.g. exported from Log Analytics). Records are classified by the Type field when present.
Table / source |
Triples |
|---|---|
SecurityEvent (4624/4625) |
|
SecurityEvent (4688) |
|
SecurityEvent (4663) |
|
SigninLogs |
|
DeviceProcessEvents |
|
DeviceNetworkEvents |
|
DeviceFileEvents |
|
CommonSecurityLog |
|
Generic JSON
Format-agnostic parser:
Normalization: Maps 80+ field name variants (case-insensitive) to canonical names (e.g.
source_user,SourceUserName→ user).Inference: Builds triples from normalized fields using rules (e.g.
source_user+source_process→User →[Execute]→ Process).Configurable: You can define or adjust field → entity type mapping and preview before ingest.
Example inferred triples:
Condition (fields present) |
Triple |
|---|---|
source_user + source_process |
|
parent_process + source_process |
|
source_host + target_ip |
|
source_ip + target_ip |
|
source_process + target_file |
|
source_process + target_domain |
|
source_process + target_url |
|
source_process + target_registry |
|
Works with any JSON log schema; no per-format config required for basic use.
CSV
Parses CSV files with headers.
Each row is converted to a JSON object and then processed by the Generic parser.
Handles quoted fields and embedded commas.
Choose CSV when the input is CSV; for JSON, use Auto-detect or Generic.
Entity and relation types
Across all parsers, the model uses:
Entity types: IP, Host, User, Process, File, Domain, Registry, URL, Service (plus wildcard * in hypotheses).
Relation types: Auth, Connect, Execute, Read, Write, DNS, Modify, Spawn, Delete (plus wildcard * in hypotheses).