Parallel algorithm to convert dense to sparse (CRS) matrix format
I need to convert a dense matrix to the compressed row storage format. I can do it with a serial algorithm, but it takes too long for big matrices. I am looking for a parallel algorithm (shared memory) to do this. Anyone ever seen one?

