Attribute Description Values
CHROM Chromosome where the mutation occurred Numerical integer values 1 to 23
POS Position on the chromosome where the mutation occurred Numerical interger value
REF Codon before mutation Codon (NNN)
OLD Codon after mutation Codon (NNN)
Interpro_domain The domain information (if any) from the InterPro database Domain description
dbNSFP_DEOGEN2_pred Information predicted by the DEOGEN2 Possible values:
D (Damaging)
T (Tolerated)
dbNSFP_MetaSVM_pred Information predicted by MetaSVM Possible values:
D (Damaging) 
T (Tolerated)
dbNSFP_fathmmMKL_coding_pred Information predicted by the fathmm-MKL_coding Possible values:
D (Deleterious)  
N (Neutral)
dbNSFP_PrimateAI_pred Information predicted by PrimateAI Possible values:
D (Damaging) 
T (Tolerated)
dbNSFP_PROVEAN_pred Information predicted by PROVEAN Possible values:
D (Deleterious)  
N (Neutral)
dbNSFP_MCAP_pred Information predicted by M-CAP Possible values:
D (Damaging) 
T (Tolerated)
dbNSFP_ClinPred_pred Information predicted by ClinPred Possible values:
D (Damaging) 
T (Tolerated)
dbNSFP_BayesDel_addAF_pred Information predicted by the BayesDel_addAF Possible values:
D (Damaging) 
T (Tolerated)
dbNSFP_ExAC_AF Allelic frequency of ExAC Numerical value ranging from 0 to 1.
dbNSFP_Polyphen2_HVAR_pred Information predicted by Polyphen2_HVAR Possible values:
D (Damaging)     
P (Probably damaging) 
B (Benign)
dbNSFP_SIFT_pred Information predicted by the SIFT Possible values:
D (Damaging) 
T (Tolerated)
dbNSFP_FATHMM_pred Information predicted by FATHMM Possible values:
D (Damaging) 
T (Tolerated)
dbNSFP_SIFT4G_pred Information predicted by SIFT4G Possible values:
D (Damaging) 
T (Tolerated)
dbNSFP_LRT_pred Information predicted by the LRT Possible values:
D (Deleterious)
N (Neutral) 
U (Unknow)
dbNSFP_fathmmXF_coding_pred Information predicted by the fathmm-XF_coding Possible values:
D (Deleterious) 
N (Neutral)
dbNSFP_BayesDel_noAF_pred Information predicted by the BayesDel_noAF 

Possible values:
D (Damaging) 
T (Tolerated)
dbNSFP_gnomAD_exomes_AF Allelic frequency of the gnomAD Numerical value ranging from 0 to 1.
dbNSFP_Aloft_pred Information predicted by Aloft Possible values:
Dominant 
Recessive 
Tolerant
dbNSFP_MutationTaster_pred Information predicted by MutationTaster Possible values:
D (Probably Deleterious) 
N (Probably Harmless) 
A (Deleterious) 
P (Harmless)
dbNSFP_MetaLR_pred Information predicted by MetaLR Possible values:
D (Damaging) 
T (Tolerated)
dbNSFP_LISTS2_pred Information predicted by LIST-S2 Possible values:
D (Damaging) 
T (Tolerated)
dbNSFP_Polyphen2_HDIV_pred Information predicted by the Polyphen2_HDIV Possible values:
D (Damaging) 
P (Probably damaging) 
B (Benign)
dbNSFP_MutationAssessor_pred Information predicted by MutationAssessor Possible values:
H (High)
M (Medium)
L (Low)
N (Neutral)
am_pathogenicity* Pathogenicity score from AlphaMissense Numerical values ranging from 0 to 1.
am_class* AlphaMissense mutation classification Possible values:
Ambiguous 
Benign
Pathogenic
VariantEffect_EFF Classification of Single Nucleotide Polymorphism Possible values: 
NON_SYNONYMOUS_CODING
NON_SYNONYMOUS_CODING+SPLICE_SITE_REGION
NON_SYNONYMOUS_START
SYNONYMOUS_CODING
SPLICE_SITE_ACCEPTOR+INTRON
START_LOST
STOP_GAINED
STOP_GAINED+SPLICE_SITE_REGION
STOP_LOST, START_LOST
STOP_LOST+SPLICE_SITE_REGION
Risco_Mut_EFF Risk of mutation Possible values:
MODERATE, 
HIGH, 
LOW
Tipo_Mut_EFF Mutation Type Possible values:
MISSENSE
NONSENSE
SILENT
Point_Mutation_EFF Point of mutation at the codon Format: aaG/aaT
Changepr_F A nomenclature that represents the change that occurs in a protein (source: https://archive.hgvsnomenclature.org/). Formats:
p.Pro609Ala 
p.Ser43* 
p.Ter327Argext*? 
p.Met1?
changeProt_am A nomenclature that represents the change that occurs in a protein Format: G765V
changecDNA_EFF A nomenclature that represents the change that has occurred in a codon (source: https://archive.hgvsnomenclature.org/) Format: c.502T>C
Gene_EFF Gene Gene Symbol
RefSeq_EFF Reference Sequence database Accession number Example: NM_001282426.1
Exon_EFF Exon number Numerical value ranging from 1 to n.
Pos_Point_Mutation_EFF Position of the mutation in the codon The Pos_Point_Mutation_EFF attribute will be rendered as follows: 
- Point_Mutation: aaC/aaT, Pos_Point_Mutation will have value 3.
- Point_Mutation: cTg/cGg, Pos_Point_Mutation will have value 2.
- Point_Mutation: Ctg/Atg, Pos_Point_Mutation will have value 1.
poschangecDNA_EFF Position in the transcript where the mutation occurred Numerical value ranging from 1 to n.
typechangecDNA_EFF Identify the type of change. Possible values: 
> (replacement) 
del (deletion) 
DUP (duplication)
aminBefore Amino acid before mutation (reference) Amino acid three letter abbreviations.
aminAfter Amino acid after mutation (alternative) Amino acid three letter abbreviations.
poschangeProt Position on the protein where the mutation occurred Numerical value ranging from 1 to n.
typechangeProt Identify the type of change. Possible values: 
subst (replacement) 
del (deletion) 
fs (frameshift: A shift on the reading frame between the start and stop codon of the coding sequence). 
Translation termination.
SNP_ID_COMMON Id from the dbSNP database Example: rs1393348728
COMMON It identifies whether or not the mutation is frequent in the population. Possible values: 
0 (not common in the population) 
1 (common in population)
PolyPhen2_Dam_pred Junction of the two PolyPhen2 predictors. The information will be generated based on the following conditions: 
1, if the dbNSFP_PolyPhen2_HDIV_pred or dbNSFP_PolyPhen2_HVAR_pred value are D ou P.
0, if the dbNSFP_PolyPhen2_HDIV_pred or dbNSFP_PolyPhen2_HVAR_pred values are B.
Ndamage Number of predictors that predicted the mutation as damaging in relation to the total number of predictors tested. Format:
n/20, where:
n - Number of predictors predicting the mutation as damaging
20 - Total number of predictors tested.
NdamageCalc Number of predictors that predicted the mutation as damaging. Numerical interger values ranging from 0 to 20.
Deleterious Calculated as follows: 
- Expression1: The SIFT attribute, or the PolyPhen2_Dan attribute or the PROVEAN attribute must have the value D or P.
- Expression2: The attribute dbNSFP_gnomAD_exomes_AF must be less than 10-4 .

- Expression3: The NdamageCalc attribute should be >3 

- Expression4: The COMMON attribute should be 0 

Deleteria = Expressao1 and Expressao2 and Expressao3 and Expressao4
Binary format:
0 = Non-deleterious (False).
1 = Deleterious (True).
Deleterious5 Calculated as follows: 
- Expression1: The SIFT attribute, or the PolyPhen2_Dan attribute or the PROVEAN attribute  should have the value D or P.
- Expression2: The attribute dbNSFP_gnomAD_exomes_AF must be less than 10-4.
Expression3: The NdamageCalc attribute  should be >= 5
- Expression4: The COMMON attribute should be 0. 
Deleteria5 = Expressao1 and Expressao2 and  Expressao3 and Expressao4
Binary format:
0 = Non-Deleterious5 (False).
1 = Deleterious5 (True).
Deleterious10 Calculated as follows: 
- Expression1: The SIFT attribute, or the PolyPhen2_Dan attribute or the PROVEAN attribute must have the value D or P. 

- Expression2: The attribute dbNSFP_gnomAD_exomes_AF must be less than 10-4
- Expression3: The NdamageCalc attribute should be >= 10
- Expression4: The COMMON attribute should be 0. 
Deleteria10 = Expressao1 and Expressao2 and Expressao3 and Expressao4
Binary format:
0 = Non-Deleterious10 (False).
1 = Deleterious10 (True).
Deleterious11 Calculated as follows: 
- Expression1: The SIFT attribute, or the PolyPhen2_Dan attribute or the PROVEAN attribute must have the value D or P. 
- Expression2: The attribute dbNSFP_gnomAD_exomes_AF must be less than 10-4
- Expression3: The NdamageCalc attribute should be >= 11 .
- Expression4: The COMMON attribute should be 0. 
Deleteria10 = Expressao1 and Expressao2 and Expressao3 and Expressao4
Binary format:
0 = Non-Deleterious11 (False).
1 = Deleterious11 (True).
Uniprot_id UniProtKB database identifier Example: Q8WWA0.
PDB_id Protein Data Bank database identifier Example: 1ZEH.
Resolution# Resolution of the three-dimensional structure of the protein (x-ray crystalography). The lower the resolution, the better the quality. Numerical value ranging from 0 to 3.0 Å.
Swiss-Prot# Swiss-Prot database identifier Example: ITLN1_HUMAN.
db_align_beg# Starting Position of Alignment in UNIPROT entry. Numerical interger ranging from 0 to n.
db_align_end# Final position of the alignment in UNIPROT entry. Numerical interger ranging from 0 to n.
pdbx_auth_seq_align_beg# Initial position of the alignment in the PDB (submitted by the author). Numerical interger ranging from 0 to n.
pdbx_auth_seq_align_end# Final position of the alignment in the PDB (submitted by the author). Numerical interger ranging from 0 to n.
pdbx_strand_id# Strand identifier in PDB. Letter or numerical interger.
seq_align_beg# Initial position of the alignment in the PDB (after validation by the Protein Data Bank). Numerical interger ranging from 0 to n.
seq_align_end# Final position of the alignment in the PDB (after validation by the Protein Data Bank). Numerical interger ranging from 0 to n.
db_name# Database of validation. Example: UNP (UNIPROT).
pdbx_align_begin# Initial position of alignment between the PDB and the database in the PDB file. Numerical interger ranging from 0 to n.
len_seq# Size of the alignment sequence between PDB and Uniprot Numerical interger.
Blosum62 Score associated with the frequency of substitution of one amino acid for another according to the Blosum62 matrix. Numerical interger.
groupBefore Amino acid classification group before mutation. Possible values: 
nonpolar, positivecharge, polar, negativecharge, aromatic.
groupAfter Amino acid classification group after mutation. Possible values: 
nonpolar, positivecharge, polar, negativecharge, aromatic.
groupChange Amino acid classification group before mutation and after mutation. Format: 
Nonpolar Runner 
positivechargeTOpolar
aminBeforeEssential Classifies whether the amino acid before the mutation is essential or not. Possible values:  
1 (Essential Amino Acid) 
0 (Non-essential amino acid)
aminAfterEssential Classifies whether the amino acid after the mutation is essential or not. Possible values:  
1 (Essential Amino Acid) 
0 (Non-essential amino acid) 
- (Amino Acid Free)
essencialChange Essential amino acid before mutation and after mutation. Possible values: 
0TO1,  
0TO0,  
1TO1
0TO-,  
1TO0,  
1TO-.
substitution Classifies whether the DNA replacement that occurred in the mutation was a transition or a transversion. Possible values: 
1 (transversion) 
0 (transition)
NodeId_RING The node of the interaction can be an amino acid or a binding molecule. Contains the following information: Chain, Node Position, and the Node. 
Ligands were not included in the AlphaFold2 structures.
Format: 
A:61:_:GLN
Dssp_RING Secondary structure calculated with the DSSP algorithm (Included in the RING tool).
Possible values:
H: 4-turn helix (alpha helix). Minimum length 4 residues. 
E: extended strand in parallel and/or anti-parallel beta sheet conformation. Minimum length 2 residues .
S: bend (the only non-hydrogen bond based assignment). 
T: hydrogen bonded turn (3, 4 or 5 turn). 
G: 3-turn helix (3-10 helix). Minimum length 3 residues.
B: residue in isolated β-bridge (single pair beta sheet hydrogen bond formation).
I: 5-turn helix (π helix). Min. length 5 residues.
Degree_RING Number of interactions of the node (without mutation). Numerical interger.
Bfactor_CA_RING alpha-carbon B-factor (experimental structures only). Numerical interger.
pLDDT_Global* The mean predicted local distance difference test (pLDDT) for all residues in a structure (Only for AlphaFold2 structures). Numerical values ranging from 0 (worse) to 100 (better).
pLDDT_RING* The predicted local distance difference test (pLDDT), per-residue, calculated by AlphaFold2 (Only for AlphaFold2 structures). Numerical values ranging from 0 (worse) to 100 (better).
F_AF* AlphaFold fragment identificator. In proteins with more than 2700 residues, the program analyzes overlapping fragments of 1400 residues. Each one is numbered, Fn (n = 1,2,3,...). Numerical interger.
Inter_Lig_tot# Total number of residue-ligand interactions. Numerical interger.
Inter_Res_tot Total number of residue-residue interactions. Numerical interger.
Inter_IAC_Lig_tot# Total number of IAC-type (Generic interaction) ligand interactions. Numerical interger.
Inter_VDW_Lig_tot# Total number of ligand van der Waals interactions. Numerical interger.
Inter_VDW_Res_tot# Total interactions between VDW residuals Numerical interger.
Inter_HBOND_Res_tot Total interactions between HBOND residuals Numerical interger.
Inter_PIPISTACK_Res_tot Total interactions between PIPISTACKK residuals Numerical interger.
Inter_IONIC_Res_tot Total interactions between IONIC-like residues Numerical interger.
Inter_SSBOND_Res_tot Total interactions between SSBOND residues Numerical interger.
Inter_PICATION_Res_tot Total interactions between PICATION residuals Numerical interger.
triangles_node Number of triangles that this node forms with other waste that it interacts with Numerical interger.
clusteringCoef_node Node clustering coefficient Numerical values ranging from 0 to 1.
betweennessWeighted_node node betweenness Numerical values ranging from 0 to 1.
Tissue Tissue where the mutation occurred 
Possible values: 
ACC (Adrenocortical carcinoma)
BLCA (Bladder Urothelial Carcinoma)
BRCA (Breast invasive carcinoma)
CESC (Cervical squamous cell carcinoma and endocervical adenocarcinoma) 
CHOL (Cholangiocarcinoma) 
COAD (Colon adenocarcinoma)
DLBC (Lymphoid Neoplasm Diffuse Large B-cell Lymphoma)
ESCA (Esophageal carcinoma)
GBM (Glioblastome multiform)
HNSC (Head and Neck squamous cell carcinoma)
Keech (kidney chromophobe)  
KIRC (Kidney renal clear cell carcinoma)
KIRP (Kidney renal papillary cell carcinoma)
LAML (Acute Myeloid Leukemia)
LGG (Brain Lower Grade Glioma)
LIHC (Liver hepatocellular carcinoma),
LUAD (Lung adenocarcinoma) 
LUSC (Lung squamous cell carcinoma) 
MESO (Mesothelioma)
OV (Ovarian serous cystadenocarcinoma)
PAAD (Pancreatic adenocarcinoma)
PCPG (Pheochromocytoma and Paraganglioma)
PRAD (Prostate adenocarcinoma)
READ (Rectum adenocarcinoma)
SARC (Sarcoma)
SKCM (Skin Cutaneous Melanoma)
STAD (Stomach adenocarcinoma)
TGCT (do inglês, Testicular Germ Cell Tumors)
THYM (Thymoma)
THCA (Thyroid carcinoma)
UCEC (Uterine Corpus Endometrial Carcinoma)
UCS (Uterine Carcinosarcoma)
UVM (Uveal Melanoma)
Tissue Abbreviation
Example:
LUAD

*Attributes only described on data calculated from AlphaFoldDB structures.

#Attributes only described on data calculated from RCSB PDB structures.

Ligands were not included in the AlphaFold2 structures.