Delay scheduling: a simple technique for achieving locality and fairness in cluster scheduling M Zaharia, D Borthakur, J Sen Sarma, K Elmeleegy, S Shenker, I Stoica Proceedings of the 5th European conference on Computer systems, 265-278, 2010 | 2030 | 2010 |
The hadoop distributed file system: Architecture and design D Borthakur Hadoop Project Website, 2007 | 1704 | 2007 |
vTPM: virtualizing the trusted platform module R Perez, R Sailer, L van Doorn Proc. 15th Conf. on USENIX Security Symposium, 305-320, 2006 | 1080* | 2006 |
Xoring elephants: Novel erasure codes for big data M Sathiamoorthy, M Asteris, D Papailiopoulos, AG Dimakis, R Vadali, ... arXiv preprint arXiv:1301.3791, 2013 | 930 | 2013 |
HDFS architecture guide D Borthakur Hadoop apache project 53 (1-13), 2, 2008 | 888 | 2008 |
Apache hadoop goes realtime at facebook D Borthakur, J Gray, JS Sarma, K Muthukkaruppan, N Spiegelberg, ... Proceedings of the 2011 ACM SIGMOD International Conference on Management of …, 2011 | 714 | 2011 |
Data warehousing and analytics infrastructure at facebook A Thusoo, Z Shao, S Anthony, D Borthakur, N Jain, J Sen Sarma, ... Proceedings of the 2010 ACM SIGMOD International Conference on Management of …, 2010 | 614 | 2010 |
DeTail: Reducing the flow completion time tail in datacenter networks D Zats, T Das, P Mohan, D Borthakur, R Katz Proceedings of the ACM SIGCOMM 2012 conference on Applications, technologies …, 2012 | 525 | 2012 |
Job scheduling for multi-user mapreduce clusters M Zaharia, D Borthakur, JS Sarma, K Elmeleegy, S Shenker, I Stoica EECS Department, University of California, Berkeley, Tech. Rep. UCB/EECS …, 2009 | 495 | 2009 |
{PACMan}: Coordinated memory caching for parallel jobs G Ananthanarayanan, A Ghodsi, A Warfield, D Borthakur, S Kandula, ... 9th USENIX Symposium on Networked Systems Design and Implementation (NSDI 12 …, 2012 | 437 | 2012 |
Linkbench: a database benchmark based on the facebook social graph TG Armstrong, V Ponnekanti, D Borthakur, M Callaghan Proceedings of the 2013 ACM SIGMOD International Conference on Management of …, 2013 | 411 | 2013 |
A solution to the network challenges of data recovery in erasure-coded distributed storage systems: A study on the Facebook warehouse cluster KV Rashmi, NB Shah, D Gu, H Kuang, D Borthakur, K Ramchandran 5th USENIX Workshop on Hot Topics in Storage and File Systems (HotStorage 13), 2013 | 371 | 2013 |
A" hitchhiker's" guide to fast and efficient data reconstruction in erasure-coded data centers KV Rashmi, NB Shah, D Gu, H Kuang, D Borthakur, K Ramchandran Proceedings of the 2014 ACM conference on SIGCOMM, 331-342, 2014 | 301 | 2014 |
Energy efficiency for large-scale mapreduce workloads with significant interactive analysis Y Chen, S Alspaugh, D Borthakur, R Katz Proceedings of the 7th ACM european conference on Computer Systems, 43-56, 2012 | 259 | 2012 |
Optimizing Space Amplification in RocksDB. S Dong, M Callaghan, L Galanis, D Borthakur, T Savor, M Strum CIDR 3, 3, 2017 | 257 | 2017 |
{FATE} and {DESTINI}: A framework for cloud recovery testing HS Gunawi, T Do, P Joshi, P Alvaro, JM Hellerstein, AC Arpaci-Dusseau, ... 8th USENIX Symposium on Networked Systems Design and Implementation (NSDI 11), 2011 | 125 | 2011 |
HDFS architecture D Borthakur Document on Hadoop Wiki. URL http://hadoop. apache. org/common/docs/r0 20, 2010 | 103 | 2010 |
Failure as a service (faas): A cloud service for large-scale, online failure drills HS Gunawi, T Do, JM Hellerstein, I Stoica, D Borthakur, J Robbins University of California, Berkeley, Berkeley 3, 2011 | 59 | 2011 |
Hdfs raid D Borthakur, R Schmidt, R Vadali, S Chen, P Kling Hadoop User Group Meeting, 2010 | 59 | 2010 |
Job scheduling for multi-user mapreduce clusters. EECS Department M Zaharia, D Borthakur, JS Sarma, K Elmeleegy, S Shenker, I Stoica University of California, Berkeley, Tech. Rep. UCB/EECS-2009-55, Apr, 2009-55, 2009 | 56 | 2009 |