Possible database inconsistencies
This page lists potential inconsistencies in the database, uncovered by automated data-validation routines. These may be mismatch between the format of the data and the FPbase "convention", inconsistencies with external databases (such as GenBank), or inconsistencies between different fields in the database that capture similar information (switch type and transitions, or lineage mutations and sequence). Not all items are necessarily problematic, but this is a good place to begin looking for things to clean up. If you'd rather fill in missing information in the database, start here.
Lineage Problems
Either a mutation string problem, misalignment, or doesn't yield the child sequence.
- GGvT: GdT + M1M does not match the current GGvT sequence (Δ: *0_M1insMVSKGEEVIKEFMRFKVRMEGSMNGHEFEIEGEGEGRPYEGTQTAKLKVTKGGPLPFAWDILSPLLMYGSKMYVKHPADIPDYKKLSFPEGFKWVRVMNFEDGGLVTATQDSSLQDGTLIYEVKMRGTNFPPDGPVMQKKTMGWEASTERLYPRDGVLKGEIHQALKLKDGGHYLVEFKTIYMAKKPVQLPGYYYVDTKLDITSHNEDYTRVEQYERSEGRHHLFLYDMDELYKGSTGSGSSGP)
- sfCherry: mCherry + R41H/K97T/R130L/S152T/K167N/N201D does not match the current sfCherry sequence (Δ: V2_G5del/M231_K236del)
- sfCherry2: sfCherry + E118Q/T128I/G220A does not match the current sfCherry2 sequence (Δ: M1del/G225_G226del)
- sfCherry3C: sfCherry2 + K45R/G52D/T106A/K182R/N194D does not match the current sfCherry3C sequence (Δ: *0_E1insM)
- pH-tdGFP: Superfolder GFP + N149Y/Q204H does not match the current pH-tdGFP sequence (Δ: E6_L7insLFTGVVPILVELDGDVNGHKFSVRGEGEGDATNGKLTLKFICTTGKLPVPWPTLVTTLTYGVQCFSRYPDHMKRHDFFKSAMPEGYVQERTISFKDDGTYKTRAEVKFEGDTLVNRIELKGIDFKEDGNILGHKLEYNFNSHYVYITADKQKNGIKANFKIRHNVEDGSVQLADHYQQNTPIGDGPVLLPDNHYLSTHSVLSKDPNEKRDHMVLLEFVTAAGITHGHGTGSTGSGSSGTASSEDNNMA)
- spGFP 11: Superfolder GFP + L221H/F223Y/T225N does not match the current spGFP 11 sequence (Δ: M1_K214del/H231_K238del)
- esGFP: Superfolder GFP + L42I/F46L/L64C/T65G/S72A/Y106F/L119T/E124K/N149K/L194R/L207I/S208L/K209E/L221H does not match the current esGFP sequence (Δ: T119I)
- Sumire: Superfolder GFP + T65G/Q69A/Y143G/N146I/H148G/F165Y/T203V/S205V/V224R does not match the current Sumire sequence (Δ: M1_S2insV/G143Y/F145G/L236_K238del)
- pHluorin2: pHluorin, ratiometric + F64L does not match the current pHluorin2 sequence (Δ: R80Q/T161I/A163V/G175S/L220F)
- TagRFP658: TagRFP657 + N71D/Q111H/A150T/K203E/K207N does not match the current TagRFP658 sequence (Δ: T6K/T177I/S226T)
- FusionRed-MQV: FusionRed-M + M42Q/C159V does not match the current FusionRed-MQV sequence (Δ: M171L/L175M)
- miRFP2: miRFP + D16S/V66I/P85L/P127S/A155V/Q175R/E211V/S222T/E300V does not match the current miRFP2 sequence (Δ: R22H)
- eYGPdp: YGFPdp + K69E/M205I does not match the current eYGPdp sequence (Δ: C134W)
- td5oxStayGold: tdoxStayGold + E237_G257del/S259H/S261T/A262_G267del/S271_G289del/A294_G322del/S325T/G327_G350delinsS/A352E/G353D/G354N/K355N/L356_M357insD/L356I does not match the current td5oxStayGold sequence (Δ: L258I/I259D)
- td5StayGold: tdStayGold + E237_G257del/S259H/S261T/A262_G267del/S271_G289del/A294_G322del/S325T/G327_G350delinsS/A352E/G353D/G354N/K355N/L356_M357insD/L356I does not match the current td5StayGold sequence (Δ: L258I/I259D)
Sequences that do not match GenBank
These sequences do not exactly match the sequence for the corresponding genbank ID. Sometimes this is just because the genbank sequence has slight N/C terminal differences, or perhaps a His tag. Sometimes it indicates a problem with our sequence, and sometimes Genbank is wrong! The gray text shows the mutation from the FPbase sequence to the Genbank sequence. N- and C- terminal mismatches are less of a concern than internal single point mutations. If you uncover an inconsistency in the literature (e.g. between the paper and the GenBank sequence) please let us know!
Sequences that do not start with methionine
Everything should start with Met. Cross check sequences with original publication and clean up.
Sequences containing His tag
Usually, these sequences were pulled from PDB. Cross-check with originally publication, and remove His tag and any C- N- terminal linkers
Proteins whose states/transitions are inconsistent with their switch type
This may indicate a mis-categorized switch type, or it may indicated that the states and transitions of the protein are not yet completed or accurate. See the documentation for a review on how FPbase categorizes switching types.
- Blue102: Is Timer, looks like Basic.
- Rtms5: Is , looks like Basic.
- aacuCP: Is , looks like Basic.
- ahyaCP: Is , looks like Basic.
- amilCP: Is , looks like Basic.
- amilCP580: Is , looks like Basic.
- amilCP586: Is , looks like Basic.
- amilCP604: Is , looks like Basic.
- apulCP584: Is , looks like Basic.
- cgigCP: Is , looks like Basic.
- cpasCP: Is , looks like Basic.
- gdjiCP: Is , looks like Basic.
- gfasCP: Is , looks like Basic.
- gtenCP: Is , looks like Basic.
- hcriCP: Is , looks like Basic.
- iLov: Is Multistate, looks like Photoactivatable.
- mKeima: Is Basic, looks like Multistate.
- meffCP: Is , looks like Basic.
- spisCP: Is , looks like Basic.
- stylCP: Is , looks like Basic.
- 10B: Is , looks like Basic.
- 5B: Is , looks like Basic.
- Cy11.5: Is Basic, looks like None.
- Xpa: Is Photoactivatable, looks like None.
- AvicFP2: Is Basic, looks like Photoactivatable.
- AvicFP3: Is Basic, looks like Multistate.
- jRGECO1a: Is Multistate, looks like Photoconvertible.
- GCaMP6f: Is Multistate, looks like Photoconvertible.
- GZnP3: Is Basic, looks like Multistate.
- sfCherry: Is Basic, looks like None.
- sfCherry2: Is Basic, looks like None.
- sfCherry3C: Is Basic, looks like None.
- rsGreen1: Is Photoswitchable, looks like Multistate.
- SAASoti: Is Photoconvertible, looks like Multistate.
- Flamindo2: Is , looks like Basic.
- R-GECO1: Is Basic, looks like Multistate.
- R-GECO1.2: Is Basic, looks like Multistate.
- O-GECO1: Is Basic, looks like Multistate.
- CAR-GECO1: Is Basic, looks like Multistate.
- K-GECO1: Is Basic, looks like Multistate.
- jRCaMP1a: Is Basic, looks like Multistate.
- REX-GECO1: Is Basic, looks like Multistate.
- jREX-GECO1: Is Basic, looks like Multistate.
- GRvT: Is Basic, looks like Multistate.
- ShyRFP: Is Multi-photochromic, looks like Photoactivatable.
- mVermilion: Is Basic, looks like None.
- Kohinoor2.0: Is Photoswitchable, looks like Multistate.
- Channelrhodopsin2: Is Photoactivatable, looks like Basic.
- spGFP1-10: Is Basic, looks like None.
- spGFP 11: Is Basic, looks like None.
- Chronos: Is Basic, looks like None.
- Padron2: Is Photoswitchable, looks like Multistate.
- Superfolder BFP: Is Basic, looks like None.
- mRubyFT: Is Timer, looks like Multistate.
- esGFP: Is Basic, looks like None.
- pHRed: Is Basic, looks like Multistate.
- PSLSSmKate: Is Photoswitchable, looks like Multistate.
- mScarlet2-I: Is Basic, looks like None.
- Fast-FT: Is Timer, looks like None.
- Medium-FT: Is Timer, looks like None.
- Slow-FT: Is Timer, looks like None.
- pHmScarlet: Is Basic, looks like Multistate.
- superecliptic pHluorin: Is Multistate, looks like Basic.
- Superfolder pHluorin: Is Multistate, looks like None.
- Lime: Is Multistate, looks like Basic.
- tsPurple: Is Basic, looks like None.
- scOrange: Is Basic, looks like None.
- amilCP-Orange: Is Basic, looks like None.
- amilCP-Pink: Is Basic, looks like None.
- C3PA-GFP: Is Photoactivatable, looks like None.
- frSkylan-S: Is Photoswitchable, looks like Basic.
- Bovine serum albumin: Is Basic, looks like None.
- mGFP5: Is Basic, looks like None.
- MmGFP6: Is Basic, looks like None.
- pFAST + HMBR: Is , looks like Basic.
- pFAST: Is Basic, looks like None.
- FAST: Is , looks like Basic.
- FAST + HMBR: Is , looks like Basic.
- FAST + HBR 3,5 DOM: Is , looks like Basic.
- mGold2s: Is Basic, looks like None.
- mGold2t: Is Basic, looks like None.
Possibly miscategorized as "basic"
Proteins whose names (or publication titles) suggest that they may be switchable/convertable, but have only a single fluorescent state.