Posts: 9
Threads: 1
Joined: Apr 2025
Kinda been noticing that there is a wide variance in db dumps. Especially when using different services. I see lists with JSON sometimes, and sometimes I get ones with CSV or it's alts or whatever. Is it just me or is it kinda annoying with how many different types are out there?
Posts: 10,305
Threads: 216
Joined: Jun 2023
It would absolutely help with a format ubiquity, but some data is better presented or visualized, or stored in very particular formats, we could talk file types, and also encryption''s etc
"Universal appeal is poison masquerading as medicine. Horror is not meant to be universal. It's meant to be personal, private, animal"
Posts: 128
Threads: 4
Joined: Feb 2025
    
View All
(04-09-2025, 01:23 AM)pinksauce Wrote: Kinda been noticing that there is a wide variance in db dumps. Especially when using different services. I see lists with JSON sometimes, and sometimes I get ones with CSV or it's alts or whatever. Is it just me or is it kinda annoying with how many different types are out there?
You can always just convert the file types with like online services, usually databses are sql or if the backend is fucked maybe exported as json. scrapes are usually csv or txt etc, but again, can be converted
Thank you for ranks @ Al-Sheikh and @ 5150 !
Posts: 9
Threads: 1
Joined: Apr 2025
(04-10-2025, 11:06 AM)DredgenSun Wrote: It would absolutely help with a format ubiquity, but some data is better presented or visualized, or stored in very particular formats, we could talk file types, and also encryption''s etc
That's a good point tbh. I never thought of that.
Posts: 69
Threads: 2
Joined: Jul 2024
I understand the frustration having to deal with different types of formats. As previously mentioned, it just depends on what type of data as some is easier to read/parse in json rather than csv. Just need to collect/create different tools and eventually have a solution to each problem. With simple dataset I would stick to CSV, complex data I go with JSON.
Posts: 11
Threads: 1
Joined: Apr 2025
04-12-2025, 11:06 AM
(This post was last modified: 04-12-2025, 11:08 AM by User0193847.)
(04-09-2025, 01:23 AM)pinksauce Wrote: Kinda been noticing that there is a wide variance in db dumps. Especially when using different services. I see lists with JSON sometimes, and sometimes I get ones with CSV or it's alts or whatever. Is it just me or is it kinda annoying with how many different types are out there?
i get where you're coming from but i recommend taking the time to learn a bit of python and the pandas lib, its extremely powerful and quick when it comes to processing large data enabling you to create a dataframe (table like structure) out of the different datatypes it supports multipule data types like .CSV, .xlsx, .db and .txt (pretty sure there is more but i forgot)
i think the datatype difference between dumps cannot be really solved unless the person sharing the dump took the time to filter, label, sort the data and then save it in a particular file format, that would be ideal but the complexity of filtering that data also depends on the nature of the data itself, take for example JSON file formats, their structure is widely different from CSV file formats, they're basically Javascript objects in plain text and converting these would be pretty time consuming so people just post it and leave it up for the people to morph it how they want
Posts: 9
Threads: 1
Joined: Apr 2025
04-12-2025, 02:17 PM
(This post was last modified: 04-12-2025, 02:20 PM by pinksauce.)
(04-12-2025, 11:06 AM)User0193847 Wrote: (04-09-2025, 01:23 AM)pinksauce Wrote: Kinda been noticing that there is a wide variance in db dumps. Especially when using different services. I see lists with JSON sometimes, and sometimes I get ones with CSV or it's alts or whatever. Is it just me or is it kinda annoying with how many different types are out there?
i get where you're coming from but i recommend taking the time to learn a bit of python and the pandas lib, its extremely powerful and quick when it comes to processing large data enabling you to create a dataframe (table like structure) out of the different datatypes it supports multipule data types like .CSV, .xlsx, .db and .txt (pretty sure there is more but i forgot)
i think the datatype difference between dumps cannot be really solved unless the person sharing the dump took the time to filter, label, sort the data and then save it in a particular file format, that would be ideal but the complexity of filtering that data also depends on the nature of the data itself, take for example JSON file formats, their structure is widely different from CSV file formats, they're basically Javascript objects in plain text and converting these would be pretty time consuming so people just post it and leave it up for the people to morph it how they want 
Yeah I get your point. I mention this mainly because I actually did use a method in python to get my riseup account. I just see all these db types and ngl I think that if I put a bit of time into this I could make a decent project that would allow for even more info to be used. Like getting fingerprints, tracking 1 persona via multiple emails etc etc.
Not sure if that idea is a good one yet since we already have a good amount of db look up services but I still kinda wanna try if it's worth it in the long run.
(04-12-2025, 04:02 AM)argue Wrote: I understand the frustration having to deal with different types of formats. As previously mentioned, it just depends on what type of data as some is easier to read/parse in json rather than csv. Just need to collect/create different tools and eventually have a solution to each problem. With simple dataset I would stick to CSV, complex data I go with JSON.
Yeah, the multiple tools thing is kinda something that is easy to solve in my case since I can write scripts that parse and recompile them quickly. I may try to use csv and json as exports for my project.
Posts: 14
Threads: 1
Joined: Sep 2023
JSON is the best format. Sadly it's license does not allow using it for evil
Posts: 10,305
Threads: 216
Joined: Jun 2023
(04-12-2025, 12:42 AM)pinksauce Wrote: (04-10-2025, 11:06 AM)DredgenSun Wrote: It would absolutely help with a format ubiquity, but some data is better presented or visualized, or stored in very particular formats, we could talk file types, and also encryption''s etc
That's a good point tbh. I never thought of that.
The shit thing about file types, is the extentions that only open in Word or Excel, or so particular software you have to buy, which im utterly against lol
"Universal appeal is poison masquerading as medicine. Horror is not meant to be universal. It's meant to be personal, private, animal"
Posts: 9
Threads: 1
Joined: Apr 2025
(04-14-2025, 10:00 AM)DredgenSun Wrote: (04-12-2025, 12:42 AM)pinksauce Wrote: (04-10-2025, 11:06 AM)DredgenSun Wrote: It would absolutely help with a format ubiquity, but some data is better presented or visualized, or stored in very particular formats, we could talk file types, and also encryption''s etc
That's a good point tbh. I never thought of that.
The shit thing about file types, is the extentions that only open in Word or Excel, or so particular software you have to buy, which im utterly against lol
Yeah. Excel really is up it's own ass with its formats and shit. Libeoffice is my goto and when I get their docs people just get so confused when I can't do the same stuff they can do in their bloatware.
|