The script takes a genscan file as input, and will translate it in gff format. The genscan format is described here: http://genome.crg.es/courses/Bioinformatics2003_genefinding/results/genscan.html /!\ vvv Known problem vvv /!\ You must have submited only DNA sequence, wihtout any header!! Indeed the tool expects only DNA sequences and does not crash/warn if an header is submited along the sequence. e.g If you have an header ">seq" s-e-q are seen as the 3 first nucleotides of the sequence. Then all prediction location are shifted accordingly. (checked only on the online version http://argonaute.mit.edu/GENSCAN.html. I don't know if there is the same pronlem elsewhere.) /!\ ^^^ Known problem ^^^^ /!\
agat_convert_genscan2gff.pl --genscan infile.bed [ -o outfile ] agat_convert_genscan2gff.pl -h
--genscan or -g
Input bed file that will be convert.
The source informs about the tool used to produce the data and is stored in 2nd field of a gff file. Example: Stringtie,Maker,Augustus,etc. [default: data]
The primary_tag corresponf to the data type and is stored in 3rd field of a gff file. Example: gene,mRNA,CDS,etc. [default: gene]
By default we inflate the block fields (blockCount, blockSizes, blockStarts) to create subfeatures of the main feature (primary_tag). Type of subfeature created based on the inflate_type parameter. If you don't want this inflating behaviour you can deactivate it by using the option --inflate_off.
Feature type (3rd column in gff) created when inflate parameter activated [default: exon].
-o , --output , --out , --outfile or --gff
Output GFF file. If no output file is specified, the output will be written to STDOUT.
-h or --help
Display this helpful text.