fast_protein_cluster: parallel and optimized clustering of large-scale protein modeling data

Hung, Ling-Hong; Samudrala, Ram

doi:10.1093/bioinformatics/btu098

Back to Previous Page

fast_protein_cluster: parallel and optimized clustering of large-scale protein modeling data

Feb 14 2014

By Hung, Ling-Hong ; Samudrala, Ram

http://dx.doi.org/10.1093/bioinformatics/btu098

Source: Bioinformatics. 2014; 30(12):1774-1776.

[PDF-165.72 KB]

Details Supporting Files You May Also Like

Details:

Alternative Title:

Bioinformatics

Personal Author:

Hung, Ling-Hong ; Samudrala, Ram

Description:

fast_protein_cluster is a fast, parallel and memory efficient package used to cluster 60 000 sets of protein models (with up to 550 000 models per set) generated by the Nutritious Rice for the World project.|fast_protein_cluster is an optimized and extensible toolkit that supports Root Mean Square Deviation after optimal superposition (RMSD) and Template Modeling score (TM-score) as metrics. RMSD calculations using a laptop CPU are 60× faster than qcprot and 3× faster than current graphics processing unit (GPU) implementations. New GPU code further increases the speed of RMSD and TM-score calculations. fast_protein_cluster provides novel k-means and hierarchical clustering methods that are up to 250× and 2000× faster, respectively, than Clusco, and identify significantly more accurate models than Spicker and Clusco.|fast_protein_cluster is written in C++ using OpenMP for multi-threading support. Custom streaming Single Instruction Multiple Data (SIMD) extensions and advanced vector extension intrinsics code accelerate CPU calculations, and OpenCL kernels support AMD and Nvidia GPUs. fast_protein_cluster is available under the M.I.T. license. (http://software.compbio.washington.edu/fast_protein_cluster)

Subjects:

[+]

Source:

Bioinformatics. 2014; 30(12):1774-1776.

Pubmed ID:

24532722

Pubmed Central ID:

PMC4058946

Document Type:

Journal Article

Funding:

DP1 LM011509/LM/NLM NIH HHS/United States ; DP1LM011509/DP/NCCDPHP CDC HHS/United States

Collection(s):

CDC Public Access

Main Document Checksum:

[+]

Download URL:

https://stacks.cdc.gov/view/cdc/25595/cdc_25595_DS1.pdf

File Type:

	btu098.nxml	txt
	btu098f1.gif	gif
	btu098f1.jpg	jpeg
	license.txt	txt

More +

You May Also Like

Cryo-electron microscopy structure of the TRPV2 ion channel

Cite

Zubcevic, Lejla ;

Herzik, Mark A

...

Jan 18 2016 | Nat Struct Mol Biol. 23(2):180-186.

Transient receptor potential vanilloid (TRPV) cation channels are polymodal sensors involved in a variety of physiological processes. TRPV2, a member ...

[PDF - 1.52 MB]

Regulation of signaling directionality revealed by 3D snapshots of a kinase:regulator complex in action

Cite

Trajtenberg, Felipe ;

Imelio, Juan A

...

Dec 12 2016 | eLife. 2016; 5.

Two-component systems (TCS) are protein machineries that enable cells to respond to input signals. Histidine kinases (HK) are the sensory component, t...

[PDF - 4.79 MB]

Checkout today's featured content at stacks.cdc.gov

fast_protein_cluster: parallel and optimized clustering of large-scale protein modeling data

fast_protein_cluster: parallel and optimized clustering of large-scale protein modeling data

Details:

You May Also Like

Have Questions?

CDC INFORMATION

CONNECT WITH CDC