Every Parquet file has the footer that contains metadata information: schema, row groups and column statistics. The footer is located at the end of the file.
A parquet file content starts and ends with 4-byte
PAR1 “magic” string. Right before the ending
PAR1 there is 4-byte footer length size (little-endian encoding):
The position of the footer can be easily calculated as:
File_length - Footer_length - 4