pandas.util.hash_pandas_object

pandas.util.hash_pandas_object(obj, index=True, encoding='utf8', hash_key='0123456789123456', categorize=True)[source]

Return a data hash of the Index/Series/DataFrame.

Parameters
obj:Index, Series, or DataFrame
index:bool, default True

Include the index in the hash (if Series/DataFrame).

encoding:str, default ‘utf8’

Encoding for data & key when strings.

hash_key:str, default _default_hash_key

Hash_key for string key to encode.

categorize:bool, default True

Whether to first categorize object arrays before hashing. This is more efficient when the array contains duplicate values.

Returns
Series of uint64, same length as the object

© 2008–2021, AQR Capital Management, LLC, Lambda Foundry, Inc. and PyData Development Team
Licensed under the 3-clause BSD License.
https://pandas.pydata.org/pandas-docs/version/1.3.4/reference/api/pandas.util.hash_pandas_object.html