Can anyone suggest me methodologies for extracting the complete sequence for the low quality predicted protein sequence reported in refseq database or NCBI protein database?
1)I have whole genome data of more than 50X coverage. When I do blast search (with human ortholog) against the SRA data I get many sequences because my gene of interest has 4 other similar protein sequences with approx 40% sequence identity .
2) the assembly available has missing residues at the exon regions.
My aim is to find the cDNA sequence so i could clone and characterize the protein by experimental methods
Thank You for your help. Kumar