Difference between revisions of "R Hackathon 1/Trait Evolution SG"

From Phyloinformatics
Jump to: navigation, search
Line 3: Line 3:
 
# Compare various implementations of the same methods (ape, geiger, OUCH, Mesquite)
 
# Compare various implementations of the same methods (ape, geiger, OUCH, Mesquite)
 
# Improve functionality of character fitting in r
 
# Improve functionality of character fitting in r
# Investigate statistical power of discriminating among various models
 
 
# Identify gaps in current implementation
 
# Identify gaps in current implementation
 
== Accomplishments. ==
 
== Accomplishments. ==
Line 19: Line 18:
 
#** ape is reporting the joint likelihoods for ancestral states. This uses the single set of ancestral states that together result in the highest likelihood on the whole tree.
 
#** ape is reporting the joint likelihoods for ancestral states. This uses the single set of ancestral states that together result in the highest likelihood on the whole tree.
 
#** mesquite and geiger use marginal likelihoods for ancestral states. This represents the likelihood averaged over all possible ancestral character state values.
 
#** mesquite and geiger use marginal likelihoods for ancestral states. This represents the likelihood averaged over all possible ancestral character state values.
# Improve functionality or, at least, interpretability of output
+
#** This also means that you get different ancestral state reconstructions from ape and mesquite
#* GEIGER was modified to give more reliable results by a more thorough search of the likelihood surface
+
# Improved functionality
#* Some ape functions seem (to us) unreliable for large trees
+
#* geiger was modified to give more reliable results by a more thorough search of the likelihood surface (fitContinuous)
 
+
#* geiger can deal with a more general set of discrete character models (fitDiscrete)
Test
+
#* geiger's tree transformations now work for nonultrametric trees
** Clarified why one sometimes gets different results from different programs
+
# Identify gaps in current implementation
** Modified GEIGER package so that results for fitting models of character evolution are more robust
+
#* The main gap, from an end-user perspective, is obtaining estimates of ancestral character states in r
** Modified functions to rescale trees and variance-covariance matrices for non-ultrametric trees
+
#* ape does this, but only for joint likelihoods, and the function sometimes has trouble finding the ML solution
 +
==To do==
 +
# Implement "white noise" model in geiger's fitContinuous
 +
# Investigate statistical properties of these methods
 +
#* Which models can we tell apart?
 +
#* How much data is needed?
 +
#* Are parameter estimates biased?
  
  
 
[[Category:R Hackathon 1]]
 
[[Category:R Hackathon 1]]

Revision as of 16:36, 24 January 2008

  • Participants: Harmon, Hipp, Hunt

Targets.

  1. Compare various implementations of the same methods (ape, geiger, OUCH, Mesquite)
  2. Improve functionality of character fitting in r
  3. Identify gaps in current implementation

Accomplishments.

  1. Evaluated the results of continuous character analyses in different packages
    • Packages are mostly consistent
    • Discrepancies come from two sources:
      • Different approaches (e.g. marginal versus joint likelihood)
      • Difficulties in finding the ML solution
    • For continuous characters:
      • geiger and OUCH tend to return the same parameter estimates
      • But they return different likelihoods
    • For discrete characters
      • geiger and mesquite are consistent, returning the same parameter estimates and likelihoods
      • geiger and ape are different
      • ape is reporting the joint likelihoods for ancestral states. This uses the single set of ancestral states that together result in the highest likelihood on the whole tree.
      • mesquite and geiger use marginal likelihoods for ancestral states. This represents the likelihood averaged over all possible ancestral character state values.
      • This also means that you get different ancestral state reconstructions from ape and mesquite
  2. Improved functionality
    • geiger was modified to give more reliable results by a more thorough search of the likelihood surface (fitContinuous)
    • geiger can deal with a more general set of discrete character models (fitDiscrete)
    • geiger's tree transformations now work for nonultrametric trees
  3. Identify gaps in current implementation
    • The main gap, from an end-user perspective, is obtaining estimates of ancestral character states in r
    • ape does this, but only for joint likelihoods, and the function sometimes has trouble finding the ML solution

To do

  1. Implement "white noise" model in geiger's fitContinuous
  2. Investigate statistical properties of these methods
    • Which models can we tell apart?
    • How much data is needed?
    • Are parameter estimates biased?