Publications
Found 120 results
(2011). CudaDMA: Optimizing GPU Memory Bandwidth via Warp Specialization.
Intertantional Conference on Super Computing, SC'11. Abstract
(2011). Bringing Parallel Performance to Python with Domain-Specific Selective Just-in-Time Specialization.
Python for Scientific Computing Conference 2011. Abstract
(2010). Advances in the Parallelization of Music and Audio Applications.
Proceedings of the International Computer Music Conference (2010). Abstract
(2010). Opportunities and Challenges of Parallelizing Speech Recognition.
Second USENIX Workshop on Hot Topics in Parallelism (HotPar 2010). Abstract
(2010). Specifying and Verifying Sparse Matrix Codes.
The 15th Annual ACM SIGPLAN International Conference on Functional Programming (ICFP 2010).
(2010). A Case for FAME: FPGA Architecture Model Execution.
International Symposium on Computer Architecture (ISCA-2010). Abstract
(2010). Composing Parallel Software Efficiently with Lithe.
Programming Language Design and implementation (PLDI-2010). Abstract
(2010). RAMP Gold: An FPGA-based Architecture Simulator for Multiprocessors.
Design Automation Conference (DAC-2010). Abstract
(2010). Resource Management in the Tessellation Manycore OS.
2nd USENIX Workshop on Hot Topics in Parallelism (HotPar '10). Abstract
