INVESTIGADORES
CAVAGNARO pablo Federico
artículos
Título:
Genome-wide characterization of simple sequence repeats in cucumber (Cucumis sativus L.)
Autor/es:
CAVAGNARO, P. F.; SENALIK, D.; YANG, L.; SIMON, P. W.; HARKINS, T.; KODIRA, C. D.; HUANG, S.; WENG, Y.
Revista:
BMC GENOMICS
Editorial:
BIOMED CENTRAL LTD
Referencias:
Año: 2010 vol. 11 p. 569 - 586
ISSN:
1471-2164
Resumen:
Background
Cucumber, Cucumis sativus L. is an important vegetable crop worldwide. Until recently,
cucumber genetic and genomic resources, especially molecular markers, have been very limited,
impeding progress of cucumber breeding efforts. Microsatellites are short tandemly repeated
DNA sequences, which are frequently favored as genetic markers due to their high level of
polymorphism and codominant inheritance. Data from previously characterized genomes has
shown that these repeats vary in frequency, motif sequence, and genomic location across taxa.
During the last year, the genomes of two cucumber genotypes were sequenced, the Chinese fresh
market type inbred line 9930 and the North American pickling type cultivar Gy14. These
sequences provide a powerful tool for developing markers in large scale. In this study, we
surveyed and characterized the distribution and frequency of perfect microsatellites in 203 Mbp
assembled Gy14 DNA sequence, representing 55% of its nuclear genome, and in EST sequences
of cucumber. Similar analyses were performed in genomic and EST data from seven other plant
species, and the results were compared with those of cucumber.
Results
A total of 112,073 perfect repeats were detected in cucumber genomic sequence, accounting for
0.9% of the genome, with an overall density of 551.9 SSRs/Mbp. While tetranucleotides were
the most frequent microsatellites, dinucleotide repeats, which had more repeat units than any
other SSR type, had the highest cumulative sequence length in genomic sequences. Coding
regions (ESTs) of the cucumber genome had fewer microsatellites compared to its genomic
sequence, with trinucleotides predominating in the former. AAG was the most frequent repeat in
cucumber ESTs. Overall, AT-rich motifs prevailed in both genomic and EST data. Compared to
the other species examined, cucumber genomic sequence had the highest density of SSRs
(although comparable to the density of poplar, grapevine and rice), and was richest in AT dinucleotides. Using an electronic PCR strategy, we investigated the polymorphism between
9930 and Gy14 at 1,006 SSR loci, and found unexpectedly high degree of polymorphism
(48.3%) between the two genotypes. The level of polymorphism seems to be positively
associated with the number of repeat units in the microsatellite. The in silico PCR results were validated empirically using a subset of the markers. In addition, primer sequences for more than 80,000 newly-discovered cucumber microsatellites, and their exact position in the Gy14 genome assembly, were made publicly available.
Conclusions
The cucumber genome is rich in microsatellites; AT and AAG are the most abundant repeat motifs in genomic and EST sequences of cucumber, respectively. Considering all the species investigated, some commonalities were noted, especially within the monocot and dicot groups, although the distribution of motifs and the frequency of certain repeats was characteristic of the species examined. The large number of SSR markers developed from this study should be a significant contribution to the cucurbit research community.