Detailed schema of UniProt in mBioSQL
Notes:
- The db_info table contains mata-data about the database.
- itype: 'cright' for copyright or 'res' for result entires;
- rtype: if itype=='res' then this is 'fuzzpro' or 'restrict';
- name: 'COPYRIGHT' in case of 'cright; while user defined when running the br_load.py script;
- descr: The copyright text or the description of the result table (user defined).
- The prots table is the main table; fields correspond to the appropriate flat file tags. fid primary key is defined as it is an integer type compared to the UniProt ID (char(12)) resulting in faster lookup of a protein.
- Tables ccs, dbx, fts, refx, kwx are tables implementing One-to-Many/Many-to-Many/Many-to-One relationships. The cc_ref, db_ref, ft_ref, kw_ref store controlled vocalbularies of UniProt (e.g. keywlst.txt, dbxref.txt)
- The result_table contains fuzzpro (EMBOSS) search reasults.