It's been another month full of community goodness and we are grateful! Let's get right to it!
Interested in versioning your data AND your notebooks?
Steve Purves CTO and co-founder of Curvenote gave a three-part tutuorial on integrating DVC and Curvenote for creating reproducible, collaborative version control for data scientists. The videos are beginner accessible with tips for intermediate git users. Access the videos here.
DVC and Curvenote for the version control win!
Next up, Puneeth Pai of Thoughtworks wrote a two-part blog series with a how-to for achieving continuous machine learning using DVC pipelines with Jenkins and DAGsHub. Quoted in the article is our own David Ortega,
Treating experiments like potential new features in a software project opens up many possibilities for improving our engineering practices.
Check out these posts at the link below or catch Puneeth at our next Meetup where he will be giving a high level overview of this content as well as answering questions.
Our Discord server has exploded since last month, up 30% in membership 😱, thanks in large part to a Towards Data Science post from Sara Metwalli recommending 9 Discord Servers for Math, Python, and Data Science You Need to Join Today.
Sara encourages readers to connect, learn and get inspired. 🚀 Thanks Sara! We're on board with that! Rest assured our growing team is hard at work creating content, improving tools and working on new tools 😶🤗 to continue to grow and serve our MLOps community!
Andrew NG of Deep Learning AI and Coursera fame has just released a new course specializing in MLOps, called Machine Learning Engineering for Production (MLOps) Specialization. The course "combines the foundational concepts of machine learning with the functional expertise of modern software development and engineering roles." Methodologies and capabilities of MLOps are introduced while addressing the challenges and consequences of machine learning engineering in production. I'm signed up! 🙋🏻♀️ How 'bout you?
Next for your learning pleasure, Simone Scardapane is in the process of fulfilling a "personal challenge" to create a PhD course for Reproducible Deep Learning that includes the use of open source tools including our own DVC! Head to the link to star the repo and cheer him on. We will be! 🙌🏼
Reproducible Deep Learning PhD Course
Simone Scardapane is in the process of fulfilling a "personal challenge" to create a PhD course for Reproducible Deep Learning that includes the use of open source tools including our own DVC! Head to the link to star the repo and cheer him on. We will be! 🙌🏼
You see what I did there, right? Reproducible… Deep Learning…
Get it? Layers of wit people. I learned from the best! Just wanted to make sure you were paying attention!
We've hit 30 team members! Our team is distributed all over the world and has grown so much that we now have two all-hands meetings! Affectionately called UTC + and UTC -, these meetings honor all our different time zones while allowing the other group to watch via recording when they are awake! You know we're all about solving complicated problems. 💪🏼
Svetlana Sachkovskaya is originally from Belarus, but is currently living in Poland. She has been a full stack developer for over seven years. She loves traveling, meeting new people and is excited to work on open source software. In her spare time you may find her dancing the tango! 💃🏻 Welcome Sveta!
Exemplifying our diverse team in one fell swoop, Casper da Costa-Luis has lived in three continents. He has been working on DVC for a couple of years and is a long-standing contributor to open source. He now joins us on the CML & Docs teams after completing his PhD in Medical Imaging. Fun facts about Casper include his becoming the U18 chess champion of Kenya when he was 14 and being a qualified SCUBA diver. 🤿
Emre Şahin joins us on the DVC team as a technical writer/ML enthusiast/AI dreamer/tutorial builder from Instanbul, Turkey. A self-described zealot for technologies, Emre has worked in many development/ML related projects and has been programming in Python since v. 1.7. We are excited for Emre to bring you excellent technical content! ✍🏼
Tapa Dipti Sitaula joins us as a Senior Product Engineer from Nepal. She previously worked as a Principal Engineer at a tech start up in India and has worked in various capacities in her career from engineering to project management and communications. Her interests include learning languages and breaking gender stereotypes. We're right there with you Tapa! 🚀
And we're still hiring!
Check out our three open roles for:
Does this sound like you or someone you know? Be in touch!
ML Repa Week took place last month and team members gave three great talks. Dmitry Petrov gave a talk on data versioning and machine learning experiments on top of Git. Elle O'Brien gave a talk on automating machine learning with Github action and GitLab CI. And finally, Mikhail Rozhkov gave a talk on setting up the workflow for machine learning batch scoring applications using DVC, MLflow and Airflow. Be sure to check out all three talks and other great talks from the week long Conference.
Don't miss our Meetup May 27th at 3:00pm UTC, where we will hear from Puneeth Pai as mentioned above 👆🏽, as well as another user putting DVC and CML into action on his team, and finally from David Ortega discussing CML pull requests! Bring your questions! We're here to help!
🦉 I'm really enjoying reading through @DVCorg's documentation and use cases for operationalizing machine learning models.https://t.co/9k8tSfXbMK— 👩💻 Paige Bailey (@DynamicWebPaige) May 17, 2021
If you've ever tried to put a model in production, these concepts will resonate. Check out their open-source project on @Github! ✨ pic.twitter.com/zsSdlivwZk
That's quite a shout out! Thanks to Jorge Orpinel and team for always raising the bar on our docs! Until next month! 👩🏽💻
Do you have any use case questions or need support? Join us in Discord!
Head to the DVC Forum to discuss your ideas and best practices.