03-07-2025, 04:25 AM
Hi,
I am researching what tools exist, to query and deduplicate leak archives.
I found this tool but it seems pretty basic.
https://github.com/JoelGMSec/LeakSearch
In a perfect world a tool would exist which indexes existing leaks, stores them in a file format that is easy to query but also compresses the data (something like Apache Parquet).
Anyone knows of any projects or tooling for something like this?
I am researching what tools exist, to query and deduplicate leak archives.
I found this tool but it seems pretty basic.
https://github.com/JoelGMSec/LeakSearch
In a perfect world a tool would exist which indexes existing leaks, stores them in a file format that is easy to query but also compresses the data (something like Apache Parquet).
Anyone knows of any projects or tooling for something like this?