Count unique values folder

operators.val.count_unique_values_folder(client, foldername_in='/foldername_in', foldername_out='/foldername_out', dtype='float', ignore='nan', worker_instance_type='x2large', manager_instance_type='small', extension_filename_in='.npy', extension_filename_out='.npy', skip_existing_files=False)
Count unique occurrences of values in a matrix.

count_unique_values_folder(client,
foldername_in=’/foldername_in’,
foldername_out=’/foldername_out’,
dtype=’float’,
ignore=’nan’,
worker_instance_type=’x2large’,
manager_instance_type=”small”,
extension_foldername_in=”.npy”,
extension_foldername_out=”.npy”,
skip_existing_files = False )
Parameters:
  • dtype – Data type of the matrix (default: float)

  • ignore – Data value to ignore

  • foldername_in – Input folder folder for the matrix

  • foldername_out – Output folder folder for the unique counts matrix

  • worker_instance_type – cloud instance type of worker nodes

  • manager_instance_type – cloud instance type of manager node

  • extension_foldername_in – File extension of files in folder for foldername_in

  • extension_foldername_out – File extension of files in folder for foldername_out

  • skip_existing_files – skip files that already exist in the output folder