Experimenting with Automatic PII Detection on the Hub using Presidio
β’
23
outlines
library with transformers
to to define a JSON schema that the generation has to follow. It uses a Finite State Machine with token_id
as transitions.Visualize HDF5 data of the dataset
JupyterLab Notebooks With PySpark Optimized for HF Datasets