A **data leak** occurs when your model gets access to data it shouldn’t have during training... usually because the information comes **from the future**, **the target**, or **a data transformation that uses future knowledge**.