Open Access archive

A data set of variants derived from 1455 clinical and research exomes is efficient in variant prioritization for early-onset monogenic disorders in Indians

Neethukrishna Kausthubham, Kasturba Medical College, Manipal
Anju Shukla, Kasturba Medical College, Manipal
Neerja Gupta, All India Institute of Medical Sciences, New Delhi
Gandham S. Bhavani, Kasturba Medical College, Manipal

Document Type

Article

Publication Title

Human Mutation

Abstract

Given the genomic uniqueness, a local data set is most desired for Indians, who are underrepresented in existing public databases. We hypothesize patients with rare monogenic disorders and their family members can provide a reliable source of common variants in the population. Exome sequencing (ES) data from families with rare Mendelian disorders was aggregated from five centers in India. The dataset was refined by excluding related individuals and removing the disease-causing variants (refined cohort). The efficiency of these data sets was assessed in a new set of 50 exomes against gnomAD and GenomeAsia. Our original cohort comprised 1455 individuals from 1203 families. The refined cohort had 836 unrelated individuals that retained 1,251,064 variants with 181,125 population-specific and 489,618 common variants. The allele frequencies from our cohort helped to define 97,609 rare variants in gnomAD and 44,520 rare variants in GenomeAsia as common variants in our population. Our variant dataset provided an additional 1.7% and 0.1% efficiency for prioritizing heterozygous and homozygous variants respectively for rare monogenic disorders. We observed additional 19 genes/human knockouts. We list carrier frequency for 142 recessive disorders. This is a large and useful resource of exonic variants for Indians. Despite limitations, datasets from patients are efficient tools for variant prioritization in a resource-limited setting.

First Page

e15

Last Page

e61

DOI

10.1002/humu.24172

Publication Date

4-1-2021

Recommended Citation

Kausthubham, Neethukrishna; Shukla, Anju; Gupta, Neerja; and Bhavani, Gandham S., "A data set of variants derived from 1455 clinical and research exomes is efficient in variant prioritization for early-onset monogenic disorders in Indians" (2021). Open Access archive. 2953.
https://impressions.manipal.edu/open-access-archive/2953

This document is currently not available here.

COinS

Open Access archive

A data set of variants derived from 1455 clinical and research exomes is efficient in variant prioritization for early-onset monogenic disorders in Indians

Document Type

Publication Title

Abstract

First Page

Last Page

DOI

Publication Date

Recommended Citation

Search

Browse

Author Corner

Open Access archive

A data set of variants derived from 1455 clinical and research exomes is efficient in variant prioritization for early-onset monogenic disorders in Indians

Authors

Document Type

Publication Title

Abstract

First Page

Last Page

DOI

Publication Date

Recommended Citation

Share

Search

Browse

Author Corner