Resources for Researchers

COGA is a family-based study that has developed interview instruments and generated genetic data on a large number of individuals, the majority from families heavily affected by alcohol dependence. COGA welcomes qualified researchers who want to use these interview instruments and/or data. They can be accessed by collaboration or the data can be requested directly in several ways.

Accessing COGA Data

(1) Apply for data directly from NIAAA.

Available data include:

Genetic linkage data (microsatellite markers) on the densely affected COGA families, including pedigree plots

GWAS data

DNA samples

Phenotypic data on all family members

Electroencephalography (EEG) data

Note that the subject IDs for these datasets are different than the IDs used by COGA investigators. Additionally, the wealth of data collected by COGA across the years can be challenging for investigators to navigate. For that reason, some investigators choose Option (2).

(2) Collaborate directly with COGA investigators.

The advantage of this approach is that COGA collaborators can help navigate the extensive phenotypic and genotypic datasets, and provide access to new data that may not be publicly available. If you are interested in working with COGA on a project, start by identifying a COGA researcher with interests that map on to the project you would like to pursue. After ensuring your proposed project does not overlap with on-going or planned COGA analyses, that individual can help you navigate the collaboration process, which will involve writing an abstract of the proposed work, presenting your work on a COGA call, and inviting other COGA collaborators with relevant expertise to be involved.

(3) Access data through dbGaP

Six sets of GWAS data, along with limited phenotypic data, are available through NCBI (National Center for Biotechnology Information). For all datasets the subject ID is randomized and different than the ID used by COGA investigators. Each of these datasets contains unique individuals, genotyped at different stages in the project, with the exception of 2-127 samples genotyped on at least two different arrays, to assess quality. See Lai et al (2019) for more detail. The links to the datasets can be found here:

CIDR: Collaborative Study on the Genetics of Alcoholism Case Control Study [phs000125.v1.p1]. GWAS data on cases (primarily probands) and controls drawn from the families.

Study of Addiction: Genetics and Environment (SAGE) [phs000092.v1.p1]. GWAS data on a subset of the COGA case-control individuals plus individuals from studies of cocaine and nicotine dependence.

Families with highest density of alcohol dependence and/or extreme event-related oscillation data [phs000763.v1.p1]. GWAS data on 119 extended families of European descent are available here, along with extensive documentation.

Study on the Genetics of Alcoholism (COGA): African American Family GWAS [phs000976.v1.p1]. GWAS data on all available COGA families of African descent are available.

COGA: Smokescreen GWAS [phs001208.v2.p1]. GWAS data on all remaining COGA DNA samples, primarily of other racial background, were genotyped on the Smoke Screen array.

Exome array data [phs001208.v2.p1}. A subsample of COGA families and individuals were genotyped on the Affymetrix Exon Array.

4) Access experimental data from COGA induced pluripotent stem cells via GEO: Two experimental datasets with RNAseq data are available.

GSE226746: Effect of 2 mM and 17 mM 7-day intermittent EtOH exposure on human excitatory neurons, https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSM7083051

GSE244985: Synergistic effects of intermittent ethanol exposure and increased GIRK2 expression in NGN2 neurons, https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE244985

GSE255988 – Microglial bulk RNAseq from hi and low polygenic score lines +/- ethanol, https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE255988

GSE271585 – 18 of the iPSC lines from the COGA hi and low polygenic score collection used for quality control, https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE271585

GSE196491 – KCNJ6 variants associated with alcohol use disorder (AUD) in human induced neurons, https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE196491

GSE203530 – Alcohol reverses the effects of KCNJ6 (GIRK2) noncoding variants on excitability of human glutamatergic neurons, https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE203530

GSE56398 – Increased response to nicotine in human dopaminergic neurons derived from iPSC carrying the risk-associated SNP rs16969968, https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE56398

GSE131470 – Expression analysis of ethanol treatment on SK-N-BE(2), https://www.ncbi.xyz/geo/query/acc.cgi?acc=GSE131470

GSE189139 – Multi-omics integration analysis identifies novel genes for alcoholism with potential link to neurodegenerative diseases, https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE189139

Multiple datasets are currently embargoed and will be released upon manuscript publication

GSE253841 Massively Parallel Reporter Assay for 3′-UTR Variants Associated with Neurological Disorders in brain tissue cell lines

GSE253842 Perturb-seq analysis using genes associated with alcohol consumption and alcohol use disorder

GSE277313 Integrated Single-Cell Multiomic Profiling of Caudate Nucleus Suggests Key Mechanisms in Alcohol Use Disorder.

Access to Biomaterials

Rutgers University stores, maintains, and distributes biomaterials consisting of lymphoblastoid cell lines and DNA from participating subjects.

The table is a list of all iPSC available through the NIAAA/COGA Sharing Repository from Sampled, Inc., our repository contractor.

COGA iPSC Inventory at Sampled

Cell ID	Clone	Group	Study	Sex	Matcode_Source	Clone_Inventory_Code
410	A	GABRA2-Major		F	CPL	R015019398
410	B	GABRA2-Major		F	CPL	R015019399
518	A	GABRA2-Major		F	LCL	R014455072
518	B	GABRA2-Major		F	LCL	R014455073
518	C	GABRA2-Major		F	LCL	R014455074
908	A	GABRA2-Major		F	CPL	R015019394
908	B	GABRA2-Major		F	CPL	R015019393
242	A	GABRA2-Minor		F	LCL	R012287432
311	A	GABRA2-Minor		F	CPL	R015019395
311	B	GABRA2-Minor		F	CPL	R015019396
311	C	GABRA2-Minor		F	CPL	R015019397
329	A	GABRA2-Minor		F	LCL	R012324618
329	B	GABRA2-Minor		F	LCL	R012324619
6344	A	High PRS		F	ERYB	ERYB000580
6714	A	High PRS		F	ERYB	ERYB000542
7204	A	High PRS	Li	F	ERYB	ERYB000248
7218	A	High PRS		F	ERYB	ERYB000582
7290	A	High PRS		M	ERYB	ERYB000583
7292	A	High PRS		F	ERYB	ERYB000584
7610	A	High PRS		M	ERYB	ERYB000589
7978	A	High PRS		M	ERYB	R041067062
8059	A	High PRS	Li	M	ERYB	ERYB000539
8098	A	High PRS	Li	F	ERYB	ERYB000540
8100	A	High PRS		F	ERYB	ERYB000250
8203	A	High PRS	Li	M	ERYB	ERYB000251
8260	A	High PRS	Li	F	ERYB	ERYB000252
8528	A	High PRS	Li	M	ERYB	ERYB000253
8864	A	High PRS	Li	M	ERYB	ERYB000255
10179	A	High PRS	Li	M	ERYB	ERYB000534
7897	A	Low PRS		F	ERYB	ERYB000585
8092	A	Low PRS	Li	F	ERYB	ERYB000249
8403	A	Low PRS		F	ERYB	ERYB000541
8621	A	Low PRS		M	ERYB	ERYB000611
8838	A	Low PRS	Li	M	ERYB	ERYB000254
9206	A	Low PRS	Li	F	ERYB	ERYB000256
9618	A	Low PRS	Li	M	ERYB	ERYB000393
9895	A	Low PRS		M	ERYB	ERYB000612
9990	A	Low PRS		F	ERYB	ERYB000588
10028	A	Low PRS	Li	M	ERYB	ERYB000533
10196	A	Low PRS	Li	F	ERYB	ERYB000535
10290	A	Low PRS		M	ERYB	ERYB000579
10399	A	Low PRS	Li	F	ERYB	ERYB000520
10608	A	Low PRS	Li	F	ERYB	ERYB000538
10884	A	Low PRS	Li	M	ERYB	ERYB000247
10929	A	Low PRS	Li	M	ERYB	ERYB000537
233	A	KCNJ6 AF	Popova	F	CPL	R012306756
233	B	KCNJ6 AF	Popova	F	CPL	R012306757
233	C	KCNJ6 AF	Popova	F	CPL	R012306758
246	A	KCNJ6 AF	Popova	M	CPL	R012303680
246	B	KCNJ6 AF	Popova	M	CPL	R012303681
351	A	KCNJ6 AF	Popova	F	CPL	R012765681
351	B	KCNJ6 AF	Popova	F	CPL	R012765682
351	C	KCNJ6 AF	Popova	F	CPL	R012765683
376	A	KCNJ6 AF	Popova	M	CPL	R012303683
376	B	KCNJ6 AF	Popova	M	CPL	R012303684
376	C	KCNJ6 AF	Popova	M	CPL	R012303685
171	A	KCNJ6 UN		F	CPL	R012314254
171	B	KCNJ6 UN		F	CPL	R012314255
171	C	KCNJ6 UN		F	CPL	R012314256
384	A	KCNJ6 UN	Popova	M	ERYB	ERYB000103
384	B	KCNJ6 UN	Popova	M	ERYB	ERYB000104
384	C	KCNJ6 UN	Popova	M	ERYB	ERYB000105
407	A	KCNJ6 UN		M	CPL	R012311931
407	B	KCNJ6 UN		M	CPL	R012311932
407	C	KCNJ6 UN		M	CPL	R012311933
420	A	KCNJ6 UN	Popova	F	ERYB	ERYB000106
420	B	KCNJ6 UN	Popova	F	ERYB	ERYB000107
420	C	KCNJ6 UN	Popova	F	ERYB	ERYB000108
451	A	KCNJ6 UN	Popova	F	ERYB	ERYB000109
451	B	KCNJ6 UN	Popova	F	ERYB	ERYB000110
451	C	KCNJ6 UN	Popova	F	ERYB	ERYB000111
472	A	KCNJ6 UN	Popova	M	CPL	R012306865
472	B	KCNJ6 UN	Popova	M	CPL	R012306866
472	C	KCNJ6 UN	Popova	M	CPL	R012306867
474	A	KCNJ6 UN		F	CPL	R012311935
474	B	KCNJ6 UN		F	CPL	R012311936
474	C	KCNJ6 UN		F	CPL	R012311937

COGA iPSC lines available through the NIAAA/COGA Sharing Repository. Use the Clone Inventory Code in the request. The Cell ID, Sex, and Group match cell line identifiers in publications denoted in the Study column (Popova: https://doi.org/10.1038/s41380-022-01818-x; Li: https://doi.org/10.1101/2024.02.19.581066). Matcode Source indicates the cell type used for reprogramming: CPL: cryopreserved lymphocytes; ERYB: erythroblasts; LCL: lymphocyte cell line. In most cases, multiple, reprogrammed iPSC were prepared and frozen but generally only the A clone was used in published studies.

Researchers may gain access to biomaterials by completing an application as described:

https://www.niaaa.nih.gov/major-initiatives/collaborative-studies-genetics-alcoholism-coga-study#Access

COGA Instruments

COGA developed the Semi-Structured Assessment for the Genetics of Alcoholism (SSAGA) interview, which we make freely available for researchers. The SSAGA is available in both adult and child forms. Different versions of the SSAGA have been administered across the different phases of data collection. To obtain the most recent version of the SSAGA, along with associated diagnostic algorithms, please contact Dr. Victor Hesselbrock.

Data Entry in Blaise

SSAGA-IV Entry Program – Instructions for downloading Blaise

Installation:

Users must obtain a Blaise license to download the dep.exe file. Once this is done, please email Sue Winkeler at winkeler@wustl.edu to obtain a password.

After downloading the Blaise executable file, dep.exe, unzip it to a folder of your choice. Note: If all files are unzipped to the same directory along with the dep.exe file, the specification pathnames will be simplified.

To Test:

Click on dep.exe in your folder. A window will open up. You will be prompted to enter the name of the data model. Paste or type the pathname for the test section chosen (e.g. tstDR.bdb) in the window.

On the first screen of the entry program, you will be prompted to enter the ID.

You may stop at this time during the data entry process and save the entered data. You may go back and change any previously entered responses. The program will enforce any changes to skip-out patterns associated with the changed responses.

Comments:

Some age checks are missing. This is intentional. These were omitted at the suggestion of the Assessment Committee.

CSSAGA-IV Entry Program

CSSAGA_f1

Electronic Data Entry using ASPECT with the SSAGA form heading

INFOTECHSoft Website

Resources for Researchers

Accessing
COGA Data

Research Instruments

COGA
Publications

Accessing COGA Data

Access to Biomaterials

COGA iPSC Inventory at Sampled

COGA Instruments

Data Entry in Blaise

COGA

AccessingCOGA Data

Research Instruments

COGAPublications

Accessing COGA Data

Access to Biomaterials

COGA iPSC Inventory at Sampled

COGA Instruments

Data Entry in Blaise

COGA

Accessing
COGA Data

COGA
Publications