Every Parquet file has the footer that contains metadata information: schema, row groups and column statistics. The footer is located at the end of the file.
A parquet file content starts and ends with 4-byte PAR1
“magic” string. Right before the ending PAR1
there is 4-byte footer length size (little-endian encoding):
The position of the footer can be easily calculated as: File_length - Footer_length - 4