[treebase]

[intro]

[why]

[search]

[contact]

[submit]

[stats]

[referee]

[API]

 

 

 [ Title Bar ]

 

The Submission Process

The usual sequence of events is as follows: the author submits a paper for review and initiates a TreeBASE submission. At this stage, the submission is classified as "in progress" -- a submission number is issued (e.g. "SN345") so that the author can return to the submission at a later date and a P.I.N. is issued (e.g. "653453") so that a journal editor or referee can inspect the uploaded data.

When the journal editor says that the paper is "accepted" or "accepted with minor revision," the author can return to the submission and change its status from "in progress" to "ready." At that point, TreeBASE staff will verify that the submission is complete, and if so, the data are made public and the study accession number (e.g. "S123") is generated and emailed to the author. Note that the submission number is not the same as the study accession number -- the former is only temporary, the latter is permanent.

In addition, one or more matrix accession numbers are issued, each one specifying a specific data matrix. The author can, if he or she wishes, insert these numbers in the final draft of the manuscript or in the galley proofs. Authors submitting only one data matrix may wish to only report just the study accession number; authors submitting several matrices may wish to report the matrix accession numbers and indicate which one belongs to which dataset.

Sometimes the author would like to have the data withheld until the journal issue has reached the library shelves. In such cases, the author should simply keep the submission status "in progress," and only change it to "ready" once the journal issue is published. However, this does not allow the author to include the final accession number in the paper. Authors who want to publish accession number and have the data withheld should email TreeBASE staff with specific instructions to this effect.

The following requirements must be met for TreeBASE to accept a submission:

  1. The paper must be published in a peer-reviewed journal or book. Data for manuscripts that are "accepted with minor revision," "accepted," or "in press" can be included in TreeBASE. Data for manuscripts that are "submitted" or "in preparation" must not move beyond the initial stage of TreeBASE submission.
  2. The names of all authors listed with the paper must be included.
  3. At least one data matrix must be included.
  4. At least one tree must result from an analysis of one or more data matrices. In principle we do not accept matrices that were not analyzed to produce a tree; nor do we accept a tree for which the matrix used to produce it is missing.
  5. Normally the trees should be limited to those that were published as figures in the manuscript. However, a set of trees that were used to produce a published consensus tree is also acceptable (within reason). If a tree is not published in a figure but its existence is discussed in the text then it is also, in principle, acceptable.
  6. Only matrices and trees listed with an analysis will be included in the submission. A tree may be listed with only one analysis, but a matrix can be listed with more than one analysis.



The Nexus Format

The (unpaid) staff at TreeBASE will be grateful if (for the time being) you submit your data in nexus format, but we will happily accept your data in other formats as well. We plan on including automated data entry pages for PHYLIP and Hennig86 formats in the future, but for now we are only doing automated tree annotation for NEXUS files. We want to work with you to get your data into TreeBASE, regardless of format.

If your data are not in nexus format, please email the data to us, and we will make the conversion for you.

Otherwise, please proceed with the automated submission process (below).



Things you will need:

  1. This web submission system will require that you copy and paste the abstract, and later, the nexus file, into "text areas." Some web browsers do not allow large files to be pasted into a field. For example Microsoft Explorer does not allow much more than 10 Kbytes to be pasted into a text area. In contrast, recent versions of Netscape do not appear to have this size limit. If you are submitting a large dataset, we suggest that you use Netscape. If you are submitting a dataset of more than 500 Kbytes, we recommend that you contact us and warn us in advance.
  2. You will need your name and email address.
  3. The reference to the publication (as best you can -- if the paper is "in press" omit the page numbers for now).
  4. The abstract .
  5. One nexus file for each matrix. Each nexus file must contain a data block and zero or more trees. In the case of an analysis on two or more datasets (e.g. one morphological and one molecular) these can be submitted as one combined nexus file, or (better yet) as two or more separate nexus files. If submitting separate nexus files, and if one tree resulted from the combined analysis, include this tree in either nexus file (but not both, or the tree will be added twice). Please do not include branch lengths in the tree file! (i.e., remove the colon notation and the values which are used to indicate branch lengths or bootstrap values).
  6. Each tree in the nexus files should be named according to the figure name in the publication (e.g. "Fig. 2"). Note whether each tree is a "single" tree (e.g. a single most parsimonious tree) or a "consensus" tree (the consensus of two or more trees). As a rule, we will accept up to 20 trees per submission. If your submission contains more than 20 trees, please contact TreeBASE staff for advice.
  7. Be prepared to provide names for internal nodes (i.e. higher taxonomic names).
  8. Be prepared to provide a summary of your phylogenetic analyses. Each analysis is applied to one or more matrices and results in one or more trees. You will be asked to identify each matrix and each corresponding tree. You will be asked to note the basic algorithm and software used.
  9. Please verify that taxonomic names are correctly written. If names need to be edited, delete the matrix and then resubmit an edited nexus file. Ideally, edit the nexus file in MacClade ahead of time so that names changed in the data matrix precisely match those in the trees.

For taxonomic names:

  • Avoid using abbreviations (e.g. write Homo_sapiens, not H._sapiens).
  • Avoid brackets, parentheses, commas, colons, and semicolons. Use periods instead.
  • Avoid using codes for non-redundant names (e.g. write Drosophila_melanogaster, not DMelan45GRX). However, it is acceptable two write Drosophila_melanogaster_45GRX if the same tree also contains other populations within the same species.
  • Avoid using nonspecific names, such as Hypothetical_Ancestor, or Outgroup. Instead, write something specific, such as Outgroup_to_Magnoliaceae.
  • Verify that taxon names found in trees are also found in all corresponding matrices.
  • Remember that nexus format requires that you use a "_" instead of a white space (i.e.: " "), and any taxon name containing characters other than alphanumerics or period must be surrounded by single quotes, e.g.: 'Zygiella_x-notata'.



Preview of the submission page.

Our submission system does not require that the entire submission be completed in one sitting. In fact, it is designed to allow submitters to return, edit, and add to their submission at any time in the future until it is finally complete. In order to return to an incomplete submission, you need to remember your submission number.

The following is an overview of what the submission page contains and what you need to do in each section.

Section A.

Enter the names of authors and their respective email addresses here. Use of first and middle names instead of first and middle initials will better help identify the author, especially if the last name is common (e.g. "Smith"). Click "Add Author" to accept the new author. Click "Delete Author" to remove the last listed author.

Section B.

Enter the reference here. If the article is "in press," you may not have page numbers available. If so, just enter as much information as you can. Later, send us an email with the complete reference. Avoid the use of strange characters. Click "Update" to accept the modified reference.

Section C.

Enter the abstract here. The program will remove return keys automatically. Click "Update" to accept the abstract.

Section D.

Click the "Add Matrix" button to submit a nexus file.

If possible, please edit your nexus file with MacClade prior to posting it. This will improve the chances of successful parsing.

After clicking "Add Matrix" you will be presented with the following page:

Using copy/paste, enter the nexus file in the window and click the "Upload Data" button. After a short delay, you should see a page that looks like the following:

From the popup list that appears that the top of the page, select the type of data that you are submitting (e.g. morphological, nucleic acid, etc.). Then name your matrix. If your matrix was published, please use the name of the table or appendix as it appears in the publication (e.g. "Table 2" or "Appendix A"). Otherwise, pick a suitable name (e.g. "Not printed" or "12S mtDNA").

This page summarizes the nexus data that you sent in, and therefore it is an opportunity to verify whether the data were parsed correctly. For example, if you submitted data with character labels and states, these should appear in blue text (character label names) and red text (character state names). If you don't see them, then the parser may have failed.

Likewise, under "Data Synopsis" you should see a list of the taxa in the matrix (blue text) followed by the first 15 characters for each taxon (red text). If these data are missing then it's safe to conclude that your nexus file did not get parsed correctly.

So what should you do if your nexus fails to parse properly? Try any of the following:

  • Open your data in MacClade, than save it, and resubmit.
  • Remove comments
  • If you have large amounts of data in the Assumptions or Codons block, try removing them.
  • Have MacClade save your data in an older nexus format

If none of these succeed, email your data to us and we will try to figure out what is wrong.

If your data appear to be parsed accurately, click the "Submit Data" button and you will return to the submission page.

Whenever you upload data in this fashion, the submission page will reflect your additions in Sections E, F, and H.

Section E.

This section lists any matrices that have been uploaded. If you completed Section D, your matrix should be listed here. If you need to add a second (or third matrix, etc.), just click the "Add Matrix" button in Section D and paste in yet another nexus file.

If you need to delete a previously entered matrix, select it from the popup menu and click the "Delete Matrix" button.

Section F.

This section lists any trees that were uploaded within nexus files entered in Section D. Here you are asked to name your trees as they appeared in the publication (e.g. "Fig. 2"), and assign your trees one of two basic categories: "single" for an individual tree resulting from an analysis (such as a most parsimonious tree, or a preferred tree from a selection of most parsimonious trees); "consensus" for a tree that is a consensus of two or more single trees. Finally, you are asked to provide a recognizable title for each tree (e.g. "Rosaceae" "angiosperms" "Primates" "Green Algae" etc.). Click the "Update Tree Data" button to save the changes that you made.

By each tree is a button called "Edit Nodes." Please use this feature to name internal nodes (i.e. higher taxa). You should see something like the following:

In the center of the page, you should notice a graphic representation of the tree. If you move your mouse over the branches of the tree, you should see a message flicker in the message window (see the red arrow in the figure on the left -- it is pointing to the message window). This message gives the sequence number for a particular node, as well as it's name if it has already been named. Nodes that are already named are given a red dot (see the branch that the mouse is pointing to on the left) and they are listed separately below the tree.

To name a node, simply click on the appropriate branch, enter a name in the field, and click the "Enter Node" button. You will return to the page displaying the tree, but now you should see a red dot beside the branch that you had just edited. Additionally, the name should be listed separately below the tree.

Why name nodes? Attaching a name to a branch or node allows users to more easily find your tree and provides better connectivity between trees. For example, supposing users are not familiar with the specific, binomial names for trees in the genus Acer. It would be helpful, therefore, to name the basal branch "Acer" for a tree containing Acer species. That way a search on the name "Acer" will find this tree without having to specify, for example, Acer japonicum.

Continue clicking and naming as many nodes as you wish. When you're done, click the button at the bottom of the page to return to the submission page.

Section G.

In this section, you are asked to enter a basic outline of the analyses used to generate the trees in your study. In TreeBASE, analyses are the basic link between matrices and trees, so you must describe at least one analysis.

Given the large diversity of phylogenetic methods available, TreeBASE cannot possibly store descriptions of all procedures performed by all authors. Weighting schemes, likelihood models, transformation matrices (etc.) go above and beyond the scope of TreeBASE. (However, retaining the notation for these features in the uploaded nexus file(s) will allow users to recover this information following a matrix download).

TreeBASE does, however, seek to store a very basic overview of the data analysis used to generate the phylogenetic results of a study. A study can contain any number of analyses. Each analysis uses an algorithm (e.g. parsimony) with a software package (e.g. PAUP) on one matrix or on a combination of two or more matrices, leading to one or more trees.

Take, for example, a phylogenetic study involving morphological characters and 12S mitochondrial DNA characters. Lets say that the authors presented five trees in their study: tree 1 results from a parsimony analysis on the 12S data alone; tree 2, a maximum likelihood analysis on 12S data alone; tree 3, a parsimony analysis on the morphological data alone; tree 4, a consensus of the five most parsimonious trees from an analysis of 12S and morphological characters together; tree 5 is a "preferred" tree, selected from one of the five most parsimonious trees. For purposes of TreeBASE, the authors might consider entering four analyses: three analyses using parsimony, of which one uses both matrices and results in two trees (4 and 5); and one analysis using maximum likelihood on the molecular data alone (tree 2).

By clicking on the "Add Analysis" button, you will be presented with a page that lets you select which matrices go with which trees. This page lets you describe the relationship in terms of algorithm and software used.

Section H.

In this section, all taxa entered thus far (via nexus uploads) are listed. For each taxon, one or more matrix names follow. These indicate which taxa belong to which matrices

This is your chance to verify that the names you have entered comply with our guidelines (see above under "For taxonomic names:"). It is also your chance to verify that the same taxon entered via two or more nexus files had an identical spelling across all nexus files.

If any taxonomic names fail to comply with our guidelines, of if they appear misspelled (or differently spelled) in one or more matrices, please delete the corresponding matrix (Section E), edit the nexus file, and resubmit (Section D).

Section I.

This is the last section of the submission page. Here, you are asked to either save your submission "in progress" so that you can return to it later, or upload it for submission to TreeBASE.

If you decide to save it so as to return to it later (for example, you decided to change the names of taxa, and get back to the submission process the following day) be sure to note down your submission number. That way you can return to where you left off at a later date.

For security reasons, please use the same computer (i.e. with the same IP address) when you return to continue editing a submission. If you are using a computer which receives a temporary IP number (such as a PPP or slip connection) please finish your entire submission in a single session.

A final word.

This is a test version of TreeBASE; it is still very much under construction. The data submission form is new code and it doubtlessly still contains bugs. If you have difficulties with the form, please let us know.



If you are ready to submit data, click
here.