Linda Zhou

Data Hut May 2021 Update

We are pleased to announce the May update to DataHut, our curated site with analysis on popular data science and data engineering projects. We have added two new categories: Time Series Analysis and Workflow Management. Time Series Analysis is a sub-category of Data Science. It includes projects which perform feature extraction, prediction, trend analysis, and machine learning on time series data. We are tracking four projects in this category. Workflow… Read More »Data Hut May 2021 Update

Ray’s Ecosystem

As part of our blog series on Ray, this post analyzes the ecosystem that Ray has built around its platform. If you missed our first blog post on Ray, you might read it first.   Ray is a relatively young open source project, created in 2016 as part of the research project from UC Berkeley. Nevertheless, Ray has created an impressive ecosystem around its platform. The graph below shows the… Read More »Ray’s Ecosystem

Ray: A Distributed Computing Platform For Machine Learning

Ray is an open source project originating from the UC Berkeley RISELab in 2016. The creators of Ray launched a commercial company, Anyscale, in 2019. The Ray project has been a superstar from its inception: it received two NSF grants and sponsorships from Alibaba, Amazon Web Services, Ant Financial, ARM, CapitalOne, Ericsson, Facebook, Google, Huawei, Intel, Microsoft, Sco-tiabank, Splunk, and VMware. Without any surprise, Anyscale successfully raised $60M from two… Read More »Ray: A Distributed Computing Platform For Machine Learning

Gitee: China’s Official Open Source Repository

This is the third installment of our blog series covering open source in China. If you missed the first one or the second one, you can read these two blogs first to provide the context. In this post, we look at Gitee, which went from an obscure Git hosting service, to China’s official open source repository. Fast Download for China Gitee was launched 7 years ago by the early open… Read More »Gitee: China’s Official Open Source Repository

Open Source Models in China

This is the second installment of our blog series covering open source in China. If you missed the first one, here is the link. In this post, we look at three contribution models used in the Chinese open source community: contributing to existing projects, financial backing of existing projects, and projects created in China. Contributing to Existing Projects Just look at the CNCF alone, Chinese companies together are the third… Read More »Open Source Models in China

China is one of the largest open source contributors in the world!

Open Source usage in China has evolved from a pure consuming model to a consuming and contributing model. According to GitHub, over one million Chinese developers contributed to open source projects in 2019 and China was the second largest user of open source. Furthermore, Cloud Native Computing Foundation (CNCF) stated that “China is the third largest contributor to CNCF projects (in terms of contributors and committers) after the U.S. and… Read More »China is one of the largest open source contributors in the world!

Open Source Licenses From A Consumer Perspective

There are many articles discussing open source licenses and business models targeting investors or open source creators. If you are interested in these topics, I recommend that you read Forbes and Medium. I will not repeat various licenses from investors’ perspectives. Rather, I would like to focus on how you should evaluate open source licenses from a user’s perspective. See the above open source selection flowchart.  If you want to… Read More »Open Source Licenses From A Consumer Perspective