scphylo.datasets.add_noise#

scphylo.datasets.add_noise(df_in, alpha, beta, missing)[source]#

Add noise to the input genotype matrix.

These noise includes: 1) False positive errors (alpha) 2) False negative errors (beta) 3) Missing entry errors (missing)

Parameters
  • df_in (pandas.DataFrame) – Input genotype matrix.

  • alpha (float) – False positive error rate.

  • beta (float) – False negative error rate.

  • missing (float) – Missing entry error rate.

Returns

A noisy genotype matrix where 0 is absent, 1 is present and 3 is missing.

Return type

pandas.DataFrame