This is the repository accompanying the publication of the Genomator paper, in this repository there is the software and experimental procedures that should reproduce the results featured in that paper.
A breakdown of these directories is given below:
The core software for the Genomator tool itself, which is to be installed in an appropriate python environment for execution on any data source (in VCF) to generate new synthetic data (see included details and documentation) READMEs and documentation for the tool and its install and use are provided therein.
Python scripts and tools to reproduce the results of the paper, the tools need to be installed into an appropriate python environment. All data used in the experiments are from the 65K dataset used by Yelman et al. downloaded from URL (https://gitlab.inria.fr/ml_genetics/public/artificial_genomes/-/tree/master/1000G_real_genomes) and all preprocessing is conducted by scripts in /experiment/experiment/source_gen.py and subsequently ingested by experiment code.
A cardinality SAT solver and python wrapper built, which is compatible with Genomator.
AngularJS project for a website (with Terraform AWS deploy configuration) for a website version of Genomator
Copyright (c) 2024 Commonwealth Scientific and Industrial Research Organisation (CSIRO) ABN 41 687 119 230.
This work is freely available for non-commercial scientific research, non-commercial education, or non-commercial research projects, under the CSIRO Non-Commercial License (based on BSD 3-Clause Clear). We welcome commercial enquiries and business partnership opportunities: https://bioinformatics.csiro.au/get-in-touch/
Genomator web (https://genomator.csiro.au/) is a lite version of Genomator and allows free access for everyone. The underlying source of the lite version is also freely available via that website, under the Apache License, Version 2.0.
Note that the optional module in /tinicard is based on a GPLv2+ codebase and is therefore licensed under the GNU GPLv2+. The optional /tinicard module is not required for the main codebase to be operational, as it is a separate SAT solving software with a python wrapper to provide an API for interfacing with various python scripts. The main codebase will only attempt to import and use the API functions associated with this SAT solver if command line parameter --solver_name=tinicard is specified, and if not, will default to using the Minicard SAT solver through the PySAT python library.