COGA is a family-based study that has developed interview instruments and generated genetic data on a large number of individuals, the majority from families heavily affected by alcohol dependence. COGA welcomes qualified researchers who want to use these interview instruments and/or data. They can be accessed by collaboration or the data can be requested directly in several ways.
Accessing COGA Data
(1) Apply for data directly from NIAAA.
Available data include:
Genetic linkage data (microsatellite markers) on the densely affected COGA families, including pedigree plots
GWAS data
DNA samples
Phenotypic data on all family members
Electroencephalography (EEG) data
Note that the subject IDs for these datasets are different than the IDs used by COGA investigators. Additionally, the wealth of data collected by COGA across the years can be challenging for investigators to navigate. For that reason, some investigators choose Option (2).
(2) Collaborate directly with COGA investigators.
The advantage of this approach is that COGA collaborators can help navigate the extensive phenotypic and genotypic datasets, and provide access to new data that may not be publicly available. If you are interested in working with COGA on a project, start by identifying a COGA researcher with interests that map on to the project you would like to pursue. After ensuring your proposed project does not overlap with on-going or planned COGA analyses, that individual can help you navigate the collaboration process, which will involve writing an abstract of the proposed work, presenting your work on a COGA call, and inviting other COGA collaborators with relevant expertise to be involved.
(3) Access data through dbGaP
Six sets of GWAS data, along with limited phenotypic data, are available through NCBI (National Center for Biotechnology Information). For all datasets the subject ID is randomized and different than the ID used by COGA investigators. Each of these datasets contains unique individuals, genotyped at different stages in the project, with the exception of 2-127 samples genotyped on at least two different arrays, to assess quality. See Lai et al (2019) for more detail. The links to the datasets can be found here:
CIDR: Collaborative Study on the Genetics of Alcoholism Case Control Study [phs000125.v1.p1]. GWAS data on cases (primarily probands) and controls drawn from the families.
Study of Addiction: Genetics and Environment (SAGE) [phs000092.v1.p1]. GWAS data on a subset of the COGA case-control individuals plus individuals from studies of cocaine and nicotine dependence.
Families with highest density of alcohol dependence and/or extreme event-related oscillation data [phs000763.v1.p1]. GWAS data on 119 extended families of European descent are available here, along with extensive documentation.
Study on the Genetics of Alcoholism (COGA): African American Family GWAS [phs000976.v1.p1]. GWAS data on all available COGA families of African descent are available.
COGA: Smokescreen GWAS [phs001208.v2.p1]. GWAS data on all remaining COGA DNA samples, primarily of other racial background, were genotyped on the Smoke Screen array.
Exome array data [phs001208.v2.p1}. A subsample of COGA families and individuals were genotyped on the Affymetrix Exon Array.
4) Access experimental data from COGA induced pluripotent stem cells via GEO: Two experimental datasets with RNAseq data are available.
GSE226746: Effect of 2 mM and 17 mM 7-day intermittent EtOH exposure on human excitatory neurons, https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSM7083051
GSE244985: Synergistic effects of intermittent ethanol exposure and increased GIRK2 expression in NGN2 neurons, https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE244985
GSE255988 – Microglial bulk RNAseq from hi and low polygenic score lines +/- ethanol, https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE255988
GSE271585 – 18 of the iPSC lines from the COGA hi and low polygenic score collection used for quality control, https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE271585
GSE196491 – KCNJ6 variants associated with alcohol use disorder (AUD) in human induced neurons, https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE196491
GSE203530 – Alcohol reverses the effects of KCNJ6 (GIRK2) noncoding variants on excitability of human glutamatergic neurons, https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE203530
GSE56398 – Increased response to nicotine in human dopaminergic neurons derived from iPSC carrying the risk-associated SNP rs16969968, https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE56398
GSE131470 – Expression analysis of ethanol treatment on SK-N-BE(2), https://www.ncbi.xyz/geo/query/acc.cgi?acc=GSE131470
GSE189139 – Multi-omics integration analysis identifies novel genes for alcoholism with potential link to neurodegenerative diseases, https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE189139
Multiple datasets are currently embargoed and will be released upon manuscript publication
GSE253841 Massively Parallel Reporter Assay for 3′-UTR Variants Associated with Neurological Disorders in brain tissue cell lines
GSE253842 Perturb-seq analysis using genes associated with alcohol consumption and alcohol use disorder
GSE277313 Integrated Single-Cell Multiomic Profiling of Caudate Nucleus Suggests Key Mechanisms in Alcohol Use Disorder.
Access to Biomaterials
Rutgers University stores, maintains, and distributes biomaterials consisting of lymphoblastoid cell lines and DNA from participating subjects.
The table is a list of all iPSC available through the NIAAA/COGA Sharing Repository from Sampled, Inc., our repository contractor.
COGA iPSC Inventory at Sampled
Cell ID | Clone | Group | Study | Sex | Matcode_Source | Clone_Inventory_Code |
410 | A | GABRA2-Major | F | CPL | R015019398 | |
410 | B | GABRA2-Major | F | CPL | R015019399 | |
518 | A | GABRA2-Major | F | LCL | R014455072 | |
518 | B | GABRA2-Major | F | LCL | R014455073 | |
518 | C | GABRA2-Major | F | LCL | R014455074 | |
908 | A | GABRA2-Major | F | CPL | R015019394 | |
908 | B | GABRA2-Major | F | CPL | R015019393 | |
242 | A | GABRA2-Minor | F | LCL | R012287432 | |
311 | A | GABRA2-Minor | F | CPL | R015019395 | |
311 | B | GABRA2-Minor | F | CPL | R015019396 | |
311 | C | GABRA2-Minor | F | CPL | R015019397 | |
329 | A | GABRA2-Minor | F | LCL | R012324618 | |
329 | B | GABRA2-Minor | F | LCL | R012324619 | |
6344 | A | High PRS | F | ERYB | ERYB000580 | |
6714 | A | High PRS | F | ERYB | ERYB000542 | |
7204 | A | High PRS | Li | F | ERYB | ERYB000248 |
7218 | A | High PRS | F | ERYB | ERYB000582 | |
7290 | A | High PRS | M | ERYB | ERYB000583 | |
7292 | A | High PRS | F | ERYB | ERYB000584 | |
7610 | A | High PRS | M | ERYB | ERYB000589 | |
7978 | A | High PRS | M | ERYB | R041067062 | |
8059 | A | High PRS | Li | M | ERYB | ERYB000539 |
8098 | A | High PRS | Li | F | ERYB | ERYB000540 |
8100 | A | High PRS | F | ERYB | ERYB000250 | |
8203 | A | High PRS | Li | M | ERYB | ERYB000251 |
8260 | A | High PRS | Li | F | ERYB | ERYB000252 |
8528 | A | High PRS | Li | M | ERYB | ERYB000253 |
8864 | A | High PRS | Li | M | ERYB | ERYB000255 |
10179 | A | High PRS | Li | M | ERYB | ERYB000534 |
7897 | A | Low PRS | F | ERYB | ERYB000585 | |
8092 | A | Low PRS | Li | F | ERYB | ERYB000249 |
8403 | A | Low PRS | F | ERYB | ERYB000541 | |
8621 | A | Low PRS | M | ERYB | ERYB000611 | |
8838 | A | Low PRS | Li | M | ERYB | ERYB000254 |
9206 | A | Low PRS | Li | F | ERYB | ERYB000256 |
9618 | A | Low PRS | Li | M | ERYB | ERYB000393 |
9895 | A | Low PRS | M | ERYB | ERYB000612 | |
9990 | A | Low PRS | F | ERYB | ERYB000588 | |
10028 | A | Low PRS | Li | M | ERYB | ERYB000533 |
10196 | A | Low PRS | Li | F | ERYB | ERYB000535 |
10290 | A | Low PRS | M | ERYB | ERYB000579 | |
10399 | A | Low PRS | Li | F | ERYB | ERYB000520 |
10608 | A | Low PRS | Li | F | ERYB | ERYB000538 |
10884 | A | Low PRS | Li | M | ERYB | ERYB000247 |
10929 | A | Low PRS | Li | M | ERYB | ERYB000537 |
233 | A | KCNJ6 AF | Popova | F | CPL | R012306756 |
233 | B | KCNJ6 AF | Popova | F | CPL | R012306757 |
233 | C | KCNJ6 AF | Popova | F | CPL | R012306758 |
246 | A | KCNJ6 AF | Popova | M | CPL | R012303680 |
246 | B | KCNJ6 AF | Popova | M | CPL | R012303681 |
351 | A | KCNJ6 AF | Popova | F | CPL | R012765681 |
351 | B | KCNJ6 AF | Popova | F | CPL | R012765682 |
351 | C | KCNJ6 AF | Popova | F | CPL | R012765683 |
376 | A | KCNJ6 AF | Popova | M | CPL | R012303683 |
376 | B | KCNJ6 AF | Popova | M | CPL | R012303684 |
376 | C | KCNJ6 AF | Popova | M | CPL | R012303685 |
171 | A | KCNJ6 UN | F | CPL | R012314254 | |
171 | B | KCNJ6 UN | F | CPL | R012314255 | |
171 | C | KCNJ6 UN | F | CPL | R012314256 | |
384 | A | KCNJ6 UN | Popova | M | ERYB | ERYB000103 |
384 | B | KCNJ6 UN | Popova | M | ERYB | ERYB000104 |
384 | C | KCNJ6 UN | Popova | M | ERYB | ERYB000105 |
407 | A | KCNJ6 UN | M | CPL | R012311931 | |
407 | B | KCNJ6 UN | M | CPL | R012311932 | |
407 | C | KCNJ6 UN | M | CPL | R012311933 | |
420 | A | KCNJ6 UN | Popova | F | ERYB | ERYB000106 |
420 | B | KCNJ6 UN | Popova | F | ERYB | ERYB000107 |
420 | C | KCNJ6 UN | Popova | F | ERYB | ERYB000108 |
451 | A | KCNJ6 UN | Popova | F | ERYB | ERYB000109 |
451 | B | KCNJ6 UN | Popova | F | ERYB | ERYB000110 |
451 | C | KCNJ6 UN | Popova | F | ERYB | ERYB000111 |
472 | A | KCNJ6 UN | Popova | M | CPL | R012306865 |
472 | B | KCNJ6 UN | Popova | M | CPL | R012306866 |
472 | C | KCNJ6 UN | Popova | M | CPL | R012306867 |
474 | A | KCNJ6 UN | F | CPL | R012311935 | |
474 | B | KCNJ6 UN | F | CPL | R012311936 | |
474 | C | KCNJ6 UN | F | CPL | R012311937 |
COGA iPSC lines available through the NIAAA/COGA Sharing Repository. Use the Clone Inventory Code in the request. The Cell ID, Sex, and Group match cell line identifiers in publications denoted in the Study column (Popova: https://doi.org/10.1038/s41380-022-01818-x; Li: https://doi.org/10.1101/2024.02.19.581066). Matcode Source indicates the cell type used for reprogramming: CPL: cryopreserved lymphocytes; ERYB: erythroblasts; LCL: lymphocyte cell line. In most cases, multiple, reprogrammed iPSC were prepared and frozen but generally only the A clone was used in published studies.
Researchers may gain access to biomaterials by completing an application as described:
COGA Instruments
COGA developed the Semi-Structured Assessment for the Genetics of Alcoholism (SSAGA) interview, which we make freely available for researchers. The SSAGA is available in both adult and child forms. Different versions of the SSAGA have been administered across the different phases of data collection. To obtain the most recent version of the SSAGA, along with associated diagnostic algorithms, please contact Dr. Victor Hesselbrock.
Data Entry in Blaise
SSAGA-IV Entry Program – Instructions for downloading Blaise
Installation:
Users must obtain a Blaise license to download the dep.exe file. Once this is done, please email Sue Winkeler at winkeler@wustl.edu to obtain a password.
After downloading the Blaise executable file, dep.exe, unzip it to a folder of your choice. Note: If all files are unzipped to the same directory along with the dep.exe file, the specification pathnames will be simplified.
To Test:
Click on dep.exe in your folder. A window will open up. You will be prompted to enter the name of the data model. Paste or type the pathname for the test section chosen (e.g. tstDR.bdb) in the window.
On the first screen of the entry program, you will be prompted to enter the ID.
You may stop at this time during the data entry process and save the entered data. You may go back and change any previously entered responses. The program will enforce any changes to skip-out patterns associated with the changed responses.
Comments:
Some age checks are missing. This is intentional. These were omitted at the suggestion of the Assessment Committee.
CSSAGA-IV Entry Program
Electronic Data Entry using ASPECT with the SSAGA form heading