Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Probably incorrect unit of measure in the table column "Raw Size" #58

Open
Artanias opened this issue Jan 25, 2025 · 2 comments
Open

Probably incorrect unit of measure in the table column "Raw Size" #58

Artanias opened this issue Jan 25, 2025 · 2 comments

Comments

@Artanias
Copy link

Artanias commented Jan 25, 2025

Hello, thank you very much for your contribution!

I have started to recalculate your results and in the table column "Raw Size" I think that some of the results must have unit of measure GiB instead of GB.

For example for datasets Thunderbird, HDFS and BGL:

>>> import os
>>> bytes = os.path.getsize("bgl/BGL.log") 
>>> bytes
743185031
>>> bytes_to_MiB = bytes / 1024 / 1024
>>> bytes_to_MiB
708.7564764022827
# In the table 708.76MB

>>> bytes = os.path.getsize("thunderbird/Thunderbird.log")
>>> bytes
31788301041
>>> bytes_to_GiB = bytes / 1024 / 1024 / 1024
>>> bytes_to_GiB
29.605162368156016
# In the table 29.60GB

>>> bytes = os.path.getsize("hdfs/HDFS.log")
>>> bytes
1577982906
>>> bytes_to_GiB = bytes / 1024 / 1024 / 1024
>>> bytes_to_GiB
1.4696111027151346
# In the table 1.47GB
@zhujiem
Copy link
Member

zhujiem commented Feb 19, 2025

Hi, Sorry for the late reply. But what is the difference of GB vs GiB?

@Artanias
Copy link
Author

Artanias commented Feb 19, 2025

Hi, one GB is defined as 1000³ (1,000,000,000) bytes and one GiB as 1024³ (1,073,741,824) bytes. That means one GB equals 0.93 GiB. The first defined by International System of Units and the second defined by International Electronical Commission in 1998.

References.

  1. Difference between Gigabyte and Gibibyte (GB vs GiB).
  2. GB vs GiB: What’s the Difference Between Gigabytes and Gibibytes?.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants