Dataset at https://github.com/HannesStark/protein-localization/tree/master/data_files
License: MIT
Provenance: UniProt data (Creative Commons Attribution 4.0 International (CC BY 4.0)). Manual curation by DeepLoc and Light attention authors.
Attributed to:
The UniProt Consortium. UniProt: the Universal Protein Knowledgebase in 2025. Nucleic Acids Res. 53:D0-D0 (2025)
José Juan Almagro Armenteros, Casper Kaae Sønderby, Søren Kaae Sønderby, Henrik Nielsen, Ole Winther, DeepLoc: prediction of protein subcellular localization using deep learning, Bioinformatics, Volume 33, Issue 21, November 012017, Pages 3387–3395, https://doi.org/10.1093/bioinformatics/btx431
Hannes Stärk, Christian Dallago, Michael Heinzinger, Burkhard Rost, Light attention predicts protein location from the language of life, Bioinformatics Advances, Volume 1, Issue 1, 2021, vbab035, https://doi.org/10.1093/bioadv/vbab035