INVESTIGADORES
REINHEIMER Renata
congresos y reuniones científicas
Título:
Weighting schemes for molecular data
Autor/es:
LONE AAGESEN; REINHEIMER RENATA
Reunión:
Congreso; Hennig XXX; 2011
Resumen:
The outcome of a phylogenetic analysis based on DNA
sequence data is highly dependent on the homology-assignment step and may vary
with alignment parameter costs. Here we explore the two alignment parameters,
gap cost versus base change costs, in analyses of DNA sequences of unequal
lengths.
We apply different gap and base change costs to the
TCP-gene family data set in which the homology among sequences are unknown. Reliable
homology assessments among the different duplicates of the TCP sequences are
required before comparative studies aimed to analyze TCP genes function can be perform.
The TCP genes are
a plant specific family of transcription factors with a highly conservative
59-amino acid motive that initiates gene transcription by binding to promoter
regions of the DNA. During the evolution of land plants the TCP family has
developed through duplication and diversification. Whole genome sequencing in
plant species has lead to the discovery of a high number of TCP genes that counts
five members in the Lycophyta but more than 20 members in Eudicots such as Oryza (rice) and Vitis (grapevine).
Current knowledge
divides the TCP gene family into two clades of which members of class II genes are
known to control development of lateral branching and floral symmetry. The TCP
class I genes are less known, and homology assessments among the members of class
I is lacking. Class I TCP sequences are much more variable than Class II
sequences, in the region flanking the conserved motive. This variability has so
far hampered alignment of the sequences and consequently phylogenetic analysis of
TCP class I genes are lacking.
We analyze the TCP
class I genes within a dynamic holomology framework. Homology assessments are
complicated both by homoplastic changes in the sequences as well as sequence
length differences of up to1248 bp. We analyze both the amino-acid sequences as
well as the nucleotide sequences, searching for clades that are robust to
changes in alignment parameter costs. Clades that survive a wide array of cost changes
are considered best candidates for future comparative studies of TCP gene
function.