sequence-database-cpp
文件大小: unknow
源码售价: 5 个金币 积分规则     积分充值
资源说明:Translating the python sequence database tool into C++
This is the library creation module of a much larger project. 
The basic idea is to generate peptides from artificially digested 
proteins. 

Simply type 'Make' and then 'make run' to run the supplied fasta file, 
test.fasta. Make generates an executable called 'main'. To digest other 
fasta files simply type:  ./main test.fasta

The premise of the program is simple, given a protein:

LLHSLKIHNNTASQKTALMEQYDRYLIVENLYYRGLVSQDINIMQNVFYKELLAHVDTIP

If cleaved at all K's and R's (trypsin), this protein would generate fragments like this:
691.438 LLHSLK
993.499 IHNNTASQK
1107.5  TALMEQYDR
1326.7  YLIVENLYYR
1684.94 LLHSLKIHNNTASQK
1849.94 GLVSQDINIMQNVFYK
2101    IHNNTASQKTALMEQYDR
2434.2  TALMEQYDRYLIVENLYYR
3176.64 YLIVENLYYRGLVSQDINIMQNVFYK

Where the first column correspond to the neutral mass of the 
fragments. Amino acid masses are defined in AminoAcidMasses.h.

It is also possible that trypsin might not digest everything perfectly. 
There are two types of imperfect digestion, missed cleaveage and semi-tryptic. 
A missed cleaveage would correspond to a simple concatenation of two adjacent 
fully tryptic peptides:

LLHSLK + IHNNTASQK = LLHSLKIHNNTASQK

Semi-tryptic means that only one end of the peptide has a tryptic termini, 
all semi tryptic peptides of GLVSQDINIMQNVFYK:
G LVSQDINIMQNVFYK
GL VSQDINIMQNVFYK
GLV SQDINIMQNVFYK
GLVS QDINIMQNVFYK
GLVSQ DINIMQNVFYK
GLVSQD INIMQNVFYK
GLVSQDI NIMQNVFYK
GLVSQDIN IMQNVFYK
GLVSQDINI MQNVFYK
GLVSQDINIM QNVFYK
GLVSQDINIMQ NVFYK
GLVSQDINIMQN VFYK
GLVSQDINIMQNV FYK
GLVSQDINIMQNVF YK
GLVSQDINIMQNVFY K



本源码包内暂不包含可直接显示的源代码文件,请下载源码包。