diffalign.utils.datasetsΒΆ

Functions

accumulate_grad_from_subgraph(model, ...[, ...])

get_test_set_with_large_num_conf(base_path, ...)

base_path: directory that contains GEOM dataset dataset_name: dataset name, should be in [qm9, drugs] conf_per_mol: keep mol that has at least conf_per_mol confs, and sampling the most probable conf_per_mol confs train_size ratio, val = test = (1-train_size) / 2 tot_mol_size: max num of mols.

pdb_to_data(pdb_path, name)

prepare_pdb2(scn_dir, data_path)

prepare_pdb_valtest(scn_dir, data_path)

prepare_pdblarge(scn_dir, data_path)

preprocess_GEOM_dataset(base_path, dataset_name)

preprocess_GEOM_dataset_with_fixed_num_conf(...)

base_path: directory that contains GEOM dataset dataset_name: dataset name, should be in [qm9, drugs] conf_per_mol: keep mol that has at least conf_per_mol confs, and sampling the most probable conf_per_mol confs train_size ratio, val = test = (1-train_size) / 2 tot_mol_size: max num of mols.

preprocess_iso17_dataset(base_path)

rdmol_cluster_to_data(mol[, smiles])

rdmol_to_data(mol[, smiles, data_cls])

Classes

ConformationDataset(path[, transform])

MolClusterData([x, edge_index, edge_attr, ...])

PackedConformationDataset(path[, transform])

SidechainConformationDataset(path[, ...])