[Genome] is there a fast BED2seq tool?
Hiram Clawson
hiram at soe.ucsc.edu
Wed Nov 22 10:20:10 PST 2006
Good Morning Stein:
It has been pointed out to me that there is a command in
the kent source tree which can reverse complement sequence: faRc
> faRc - Reverse complement a FA file
> usage:
> faRc in.fa out.fa
> In.fa and out.fa may be the same file.
> options:
> -keepName - keep name identical (don't prepend RC)
> -keepCase - works well for ACGTUN in either case. bizarre for other
> letters.
> without it bases are turned to lower, all else to n's
> -justReverse - prepends R unless asked to keep name
> -justComplement - prepends C unless asked to keep name
> (cannot appear together with -justReverse)
On 2006 Nov 22, , at 7:56 AM, Hiram Clawson wrote:
> Good Morning Stein:
>
> Surprisingly enough, I do not see a stand alone tool in the source
> tree to do a reverse complement. There is a function for it of course
> in the libraries, but no command line access to it.
>
> Good idea for such a feature in twoBitToFa.
>
> --Hiram
>
> On 2006 Nov 22, , at 12:12 AM, Stein Aerts wrote:
>
>> Hi Rachel, Hiram,
>>
>> Thanks, this all works very well.
>> I'm left with one detail however: the strand in the BED file is not
>> taken into account when using this:
>>
>> awk '{printf "%s:%d-%d\n", $1, $2, $3}' file.bed \
>> | twoBitToFa hg18.2bit output.fa -seqList=stdin
>>
>>
>> So I presume it returns only + strand sequences. I could reverse
>> complement the - strand features using a bioperl script, but perhaps
>> it's possible with the ucsc tools in one go?
>>
>> Stein
More information about the Genome
mailing list