Abstract

Background: In this article we introduce the release of TIME (Tool for In-place Mo-lecular Editing), a sequence editor devoted to the analysis of large nucleotide and protein sequences such as chromosomes, genomic contigs and their encoded protein products.

Remarks: TIME offers a variety of functions for editing, translating and managing single and multiple sequence files. One of TIME's main features is its ability to process large sequences up to 25 megabases. It includes search capabilities for retrieving open reading frames (ORFs) and their coordinates as well as other user-defined motifs such as restriction, binding, priming sites, etc.

Availability: TIME is distributed by Biotech Vana S.L. at the following [URL 1]. A 30 days free trial version is available.

Introduction

A key element in the rapid advance of molecular biology and omics research has been the development of algorithms and computational methods for processing biological data, accompanied by an exponential growth in computing power value (MIPS per dollar). As sequencing throughput continues to grow owing to the advent of next gen-eration DNA sequencing methods [1-6], computational tools for handling large volumes of biological data become necessary. One of the most common troubles met during genomic sequence analyses is that, owing to the great size of a single chromosome, research is carried out in a fragmented set of sequences [7]. In this paper we present TIME, a software that tackles this problem through efficient memory management when working with se-quences that cannot fit into the main memory.

Overview

Features: TIME is a powerful and versatile tool that allows in-place editing of both nucleotide and amino acid sequences up to 25 × 10⁶, enough for full chromosomes.

TIME software has been programmed to provide easy-to-use features for both basic and advanced analyses. The functions are organized in a hybrid menu-toolbar and have been designed into a common interface that enables easy and logical handling, re-trieval, storage and results display.

TIME accepts both single and multiple sequence files in FASTA format, and allows sequences to be imported straightforwardly from databases such as GenBank [9] or EMBL [10] through copy-pasting into a blank TIME data-sheet. Users have also the option to unlock and edit any sequence either by typing or with the aforementioned cut, copy and paste commands. Sequence geometry and orientation can also be managed. As shown in Figure 1, nucleotide sequences can be translated to all six reading frames, with the start and stop codons highlighted in user-customizable colors. The genetic code for translation can be easily modified and saved into a simple plain text format, so it may be recovered in later sessions.

TIME includes flexible tools for finding ORFs and motifs checking both orientations of the translated sequence. ORFs can be required to have a minimum length and the start and stop codons are specified. As shown in Figure 2, motifs may be searched either in single occurrences or in clusters. In the latter case, parameters such as cluster size, minimum number of motifs within a cluster and/or overlapping clusters can all be speci-fied from the motif editor. ORFs and motifs can be saved and exported to CSV spread-sheet files or as a FASTA file.

Installation

The application is distributed as an installer for Windows XP/Vista/7 (32 bit and 64 bit), a self-extracting disk image for Mac OS X 10.5 or later (64 bit), and a compressed tarball archive for Linux 2.6 kernel series or later (32 bit and 64 bit).

Requirements

TIME requires Java 6 or later. The minimum system requirements for TIME are a PC with a Pentium 4 1.5 GHz or AMD Athlon XP 1500+ processor or higher with at least 1 GB of RAM.

Concluding Remarks

TIME is a powerful biological sequence editing software displayed in a clean and streamlined interface focused on making most operations one click away. It differs from other similar bioinformatic tools such as Gene Runner [URL 2] in its capacity to process sequences up to a few gigabases in size. TIME is distributed both as a standalone tool and as a component of other software we distribute and call GPRO (Futami et al.), the professional tool tailored for the management of large volumes of data in omic analysis. Currently, TIME 1.0 is available as a first fully functional release that will be upgraded with future implementations such as genome browser, restriction mapping, primer design or protein secondary structure prediction tools.

Acknowledgments

The development of TIME has been partly supported by Grant IDI-20100007 from CDTI (Centro de Desarrollo Tecnológico Industrial) and by Torres-Quevedo Grants PTQ-09-01-00020 and PTQ-09-01-00670 from MICINN (Ministerio de Ciencia e Innovación) in Spain.

Funding to pay the Open Access publication charges for this article was provided by the University of Valencia

License and distribution

TIME is commercial software owned and distributed by Biotech Vana S.L at URL1. This software is subject to a License Agreement you should accept during installation and may not be copied, reproduced or otherwise transmitted or recorded, for any purpose, without prior written permission from the owner.