Version 1.4 adds new machine learning algorithms, AI-powered feature engineering from geo-temporal data, and significant enhancements in automated data preprocessing and data collection
San Mateo, CA – March 12, 2019 – dotData, the first and only company focused on delivering end-to-end data science automation and operationalization for the enterprise, today announced the availability of Version 1.4 of its Data Science Automation Platform. This latest update adds significant enhancements to the platform and provides users with deeper insights, increased flexibility, ease-of-use, and greater performance to meet their specific business goals.
dotData will be showcasing its Enterprise Version 1.4 and dotDataPy Version 1.0 at the Gartner Data and Analytics Summit from March 18-21 in Orlando, Florida. They will also present a session, “Why Do 85% of Enterprise Data Science Projects Fail?”, on Wednesday, March 20.
dotData’s AI-powered Data Science Automation Platform completely automates the entire data science process, from data collection through production-ready models, including feature engineering.
“One of the most exciting enhancements in Version 1.4 is the support of AI-powered feature engineering for geo-temporal data. These are very rich, but difficult-to-analyze data sources while providing deeper insights over a geographical spectrum,” said Ryohei Fujimaki, PhD, dotData’s CEO. “We’ve also made significant enhancements in the machine learning and data preparation components of the platform, giving enterprises the freedom to solve more data science challenges, faster.”
Key updates of the dotData Platform Version 1.4 include:
Feature Engineering from Geo-Temporal Data
- The ability to leverage geo-temporal data such as GPS data, census information, and data from mobile devices, is growing in importance across many industries, including financial services, retail, and healthcare.
- For example, for a retail store, geo-temporal patterns, such as, “whether there is a sporting event within three miles of the store during the next week,” are often very important to enable the store to optimize its inventory. dotData Version 1.4 enables users to automatically design such geo-temporal features with a few clicks.
New State-of-the-Art Machine Learning Algorithms
- dotData Version 1.4 now supports more state-of-the-art machine learning algorithms, including Gradient Boosting (XGBoost, LightGBM), RandomForest, and others. The Platform automatically tunes the hyper-parameters of these algorithms to achieve the best performances in various statistical metrics.
- dotData users can automatically take advantage of these highly-accurate ML algorithms, in addition to previous white-box algorithms, to improve model accuracy.
Enhanced Automatic Data Preprocessing
- dotData Version 1.4 significantly enhances data preprocessing on both source data and features, including data integration, source data cleansing, and feature outlier filters, in addition to preprocessing functionalities supported in previous versions such as missing value imputation and data normalization.
- This data preprocessing is fully automated, expanding the range of automation and further freeing up data scientists to focus on the highest value projects with the biggest impact.
Drag-and-Drop Data Collection
- dotData Version 1.4 supports drag-and-drop data collection from CSV files in addition to existing JDBC data connectors. This enables users to import their locally-customized data quickly without handling SQL or interacting with databases.
The dotData Platform accelerates the entire data science process from months to days, enabling companies to rapidly scale their AI/ML initiatives to drive transformative business changes. The dotData Platform also democratizes the data science process by enabling more
participants with different skill levels to effectively execute on projects, making it possible for enterprises to operationalize 10x more projects with transparent and actionable outcomes.
dotData will be exhibiting and conducting demos in booth #637 at the Gartner Data and Analytics Summit, March 18-21 at the Orlando World Center Marriott. Vice President of Data Science Aaron Cheng will present at the Summit in a session, “Why Do 85% of Enterprise Data Science Projects Fail?”, on Wednesday, March 20, from 2:15pm – 3:00pm. dotData will also participate in the Gartner Analytic Show Floor Showdown, a series of sessions on the show floor where machine learning vendors will demonstrate their products. dotData’s session will take place on Wednesday, March 20 at 11:30am-12:30pm.
If you are interested in meeting with dotData at the Gartner Summit, contact [email protected] For more information or a demo of dotData’s AI-powered Data Science Automation Platform, please visit dotData.com.
dotData is the first and only company focused on delivering end-to-end data science automation for the enterprise. dotData’s fully-automated data science platform speeds time to value by accelerating, democratizing and operationalizing the entire data science process, from raw data ingestion through feature engineering to ML models in production. dotData is delivering new levels of speed, scale and value in successful deployments across multiple industries, including several Fortune Global 250 clients. For more information, visit dotData.com.
Zer0 to 5ive for dotData