Scikit-learn 1.0 Supports Spline Transformers, Quantile Regression and Improved Plotting API – InfoQ.com

InfoQ Live October 19: How to apply Microservices and DevSecOps to improve application maintainability, security & deployment speed. Register Now
Facilitating the spread of knowledge and innovation in professional software development


In this article, we’ll explore the benefits of using blockchain for business solutions, describing the differences between public and private versions of this technology in practice. We’ll also talk about a new type of chain — a hybrid of private and public chains which takes the benefits of both to create a truly versatile platform with no compromises.
The uptake of low code software is so strong that it will almost certainly make its way into your organization. Most software engineers shouldn’t be concerned about this because they are good at the things that low code software is not yet good at. The key to surviving and thriving during this change is ensuring that your role encompasses responsibilities that low code can’t do yet.
In the podcast, Rosaria Silipo talks about the emerging trends in deep learning, with focus on low code visual programming to help data scientists apply deep learning techniques without having to code the solution from scratch.
Scrum is easy to explain and hard to do well. The majority of Scrum Teams struggle to do Scrum well. The OMG Essence standard promises to make practices more accessible and to free them from the tyranny of formal methods and frameworks. This article explains how Essence Scrum practices produced by Ian Spence and Dr Jeff Sutherland can help your teams get better at Scrum regardless of the context
AI tools are slowly replacing the role of the developer – just as DevOps did before – and will eventually supplant DevOps entirely. Assessing whether that prediction is true is tricky. In this article, we’ll look at what AI promises for the development process, assess whether it can really ever take over from human developers, and what DevOps is likely to look like in a decades’ time.
Learn how to apply Microservices and DevSecOps to improve application security & deployment speed. Virtual Event on Oct 19th, 9AM EDT/ 3PM CEST
Turn advice from 64+ world-class professionals into immediate action items. Attend online on Nov 1-12.
Learn from practitioners driving innovation and change in software. Attend in-person on April 4-6, 2022.
InfoQ Homepage News Scikit-learn 1.0 Supports Spline Transformers, Quantile Regression and Improved Plotting API
Oct 12, 2021 2 min read
by
Anthony Alford
Scikit-learn, the popular Python-based machine learning (ML) library, has released version 1.0. Although the library has been stable for some time, and the release contains no breaking changes, the project maintainers opted for a major version revision to signal to users that the software is mature and production-ready.
The project team announced the release on Twitter. Containing 2,100 merged pull requests since the previous 0.24 release, version 1.0 contains several new features, including spline transformers, quantile regression, online one-class support vector machines (SVM), and an improved plotting API. There are also many documentation improvements, representing nearly 800 of the merged pull requests. Although there are no breaking changes, apart from those in the project's normal two-release deprecation cycle, the team decided to increment the library's major version number from 0 to 1 in recognition of the code's long-term stability and maturity.  According to Adrin Jalali, a core developer on the project:
The library has been stable for a while, and we'd like to signal that by the versioning of the release….[It] includes some features which we've wanted to have for years, so it felt right to finally do it!
Scikit-learn, billed as an "easy-to-use and general-purpose machine learning in Python," is used by over 80% of data scientists, according to Kaggle's 2020 survey. The library contains implementations of many common ML algorithms and models, including the widely-used linear regression, decision tree, and gradient-boosting algorithms. Begun in 2007 as a Google Summer of Code project, it was originally conceived as an ML "toolkit" for the Python-based scientific computing library SciPy. Scikit-learn's first public beta release was in early 2010, and in 2020 the library was accepted as a Sponsored Project by NumFOCUS, the non-profit foundation that funds SciPy and many other open-source scientific computing packages.
Several new features were included in the release. One important change is that constructor and function parameters are required to be keyword arguments instead of positional. Existing histogram-based gradient boosting models have moved from experimental to stable status, and there are also new models. First, the SGDOneClassSVM model is a linear version of the One-Class SVM that is fit using stochastic gradient descent (SGD). This can approximate the solution of a kernelized One-Class SVM with "several orders of magnitude faster" time to fit. Quantile regression models can estimate the median or other quantiles of a function; the model is fit by minimizing the pinball loss.
In a discussion about the release on Hacker News, some users noted that scikit-learn still is not a good choice for deep learning models:
– No saving checkpoints (can be crucial for large models who need a lot of compute and time)
– No way to assign different activation functions to different layers
– No complex nodes like LSTM, GRU
– No way to implement complex architectures like transformers, encoders etc
Other users also pointed out that scikit-learn does not support GPU hardware. However, most users praised the library for having good documentation and being easy to use:
scikit-learn (next to NumPy) is the one library I use in every single project at work. Every time I consider switching away from Python I am faced with the fact that I'd lose access to this workhorse of a library. Of course it's not all sunshine and rainbows – I had my fair share of rummaging through its internals – but its API design is a de-facto standard for a reason.
The scikit-learn code is available on GitHub.
 

Redis Enterprise is an in-memory database platform built by the people who develop open-source Redis. Get Started.
A round-up of last week’s content on InfoQ sent out every Tuesday. Join a community of over 250,000 senior developers. View an example

We protect your privacy.
You need to Register an InfoQ account or or login to post comments. But there’s so much more behind being registered.
Get the most out of the InfoQ experience.
Allowed html: a,b,br,blockquote,i,li,pre,u,ul,p
by Menito Bussolini,
by Menito Bussolini,
Your message is awaiting moderation. Thank you for participating in the discussion.
Please stop quoting HN. They’re a very limited group of people who don’t represent the scikit-learn philosophy and scope.

Allowed html: a,b,br,blockquote,i,li,pre,u,ul,p

Allowed html: a,b,br,blockquote,i,li,pre,u,ul,p

A round-up of last week’s content on InfoQ sent out every Tuesday. Join a community of over 250,000 senior developers. View an example

We protect your privacy.
Focus on the topics that matter in software development right now.
Deep-dive with 64+ world-class software leaders. Discover how they are applying emerging trends. Learn their use cases and best practices.
Stay ahead of the adoption curve and shape your roadmap with QCon Plus online software development conference.
InfoQ.com and all content copyright © 2006-2021 C4Media Inc. InfoQ.com hosted at Contegix, the best ISP we’ve ever worked with.
Privacy Notice, Terms And Conditions, Cookie Policy

source
Connect with Chris Hood, a digital strategist that can help you with AI.

Leave a Reply

Your email address will not be published. Required fields are marked *

© 2021 AI Caosuo - Proudly powered by theme Octo