dotData AutoML 2.0

AutoML 2.0 & Data Science Automation Platform

dotData is the world’s first and only Full-Cycle Data Science Automation Platform. It helps you automated entire Enterprise AI and Machine Learning workflow. Unlike traditional AutoML platforms, dotData automates time-consuming data wrangling and feature engineering parts of AI process, accelerating  projects to days instead of months. Data ingestion & wrangling, automated feature engineering, AutoML, model operationalization are automated with dotData Enterprise.

dotData Enterprise was designed to provide point-and-click access to Automated Machine Learning for your data-savvy professionals. With support for complex table relationships and the ability to perform computations on billions of rows of data, dotData provides the scalability and performance you need to create complex Enterprise AI models in record time.

dotData begins by providing automation features that help you connect to and prepare your data. dotData connects to your data and generates the required schemas to interpret table relationships. dotData goes beyond traditional AutoML features like raw data cleansing, and data partitioning with unique entity relationship extraction and automated table joins, aggregations and groupings to accelerate feature engineering.

dotData automatically generates features from different types of source business data with complex relations. With support for transactional, text, geo-location, temporal as well time-series data, dotData’s automated Feature engineering analyzes millions of possible features and exposes the ones that are most likely to provide value to your organization, saving your company time and giving you the ability to uncover hidden insights.

dotData supports all state of the art Machine Learning libraries and provides Machine Learning optimization features like an automated hyper-parameter search of ML algorithms, the ability to select promising features for machine learning and the automatic selection of “champion” models.

dotData goes beyond just automation with support for a wide variety of state of the art machine learning libraries like scikit-learn, XGBoost, LightGBM, TensorFlow and PyTorch

All dotData models can be deployed immediately using an innovative API approach. Create production-ready data and feature pipelines via APIs and create a production-ready ML scoring pipeline. It is possible to validate model accuracy automatically and retrain models based on real-world data using API-based integration. dotData makes deploying and maintaining AI models fast, simple, and cost-effective, giving you the best ROI and the fastest time to market possible.

dotData AI-FastStart Program

otData’s AI-FastStart program is an all-inclusive package of AutoML 2.0 Cloud-based software, professional services, ongoing training, and support all designed to work in concert to help Business Intelligence professionals accelerate their AI and Machine Learning development.

AutoML & Data Science Automation Library for Python

dotDataPy is a rich and scalable Python library that enables advanced users to access dotData’s data science automation functionality – including AI-powered feature engineering and automated machine learning – with Python code.

dotDataPy can be easily integrated with Jupyter notebooks and other Python development environments, enabling users to leverage the advanced Python ecosystem fully, including rich visualization like Matplotlib and Plotly, state-of-the-art machine learning/deep learning tools like scikit-learn, Spark MLlib, PyTorch, and TensorFlow, as well as flexible DataFrames like pandas and PySpark.

dotDataPy’s automated machine learning conducts hundreds of trials to finely tune state-of-the-art machine learning algorithms (including proprietary ones) for the best accuracy in various optimization criteria. The fully-automated process frees up the time and resources of data scientists and gives them the freedom to produce high-quality machine learning models, thus enabling teams to execute more data science projects than ever before.

Deployment

The dotData platform supports on-premise Linux servers and a scalable Hadoop infrastructure. You can deploy dotData on your physical hardware to achieve best performance while maintaining strict controls over physical and logical access to ensure data protection, all while keeping your budget in check.

dotData’s engine can seamlessly scale to hundreds or thousands of servers to automatically build sophisticated machine learning models for your organization. Regardless of the size and breadth of your data set, dotData can seamlessly scale to match your needs. Manage the speed and efficiency of your data modeling capabilities seamlessly by adding or removing computing resources.

dotData’s platform provides enterprise-grade SSL protection for Hadoop cluster communication. AES encryption support allows stored user data to remain protected, while full HTTPS support means that communication between the dotData server and user browsers is fully encrypted. dotData also provides full Kerberos-based HDFS authentication support.

dotData is easily installed as a service on your Hadoop clusters and performs distributed automated feature generation and machine learning model scoring on HDFS. dotData leverages Spark for large-scale data processing and adheres to all Hadoop management policies.

dotData can be easily deployed on Amazon AWS to deliver the scale and full-cycle automation you need while providing scalability at a cost-effective rate. Deploy AutoML and Data Science Automation without committing to specific storage, computing or machine learning vendors.

About dotData

dotData is focused on delivering end-to-end data science automation for the enterprise. dotData’s fully-automated data science platform speeds time to value by democratizing, operationalizing and accelerating the entire data science process, from source data through feature engineering to machine learning.

Unique to the dotData Platform is its AI-powered Feature Engineering, which eliminates the most time-consuming and labor- and skill-intensive aspects of the full data science process, freeing up data scientists for higher-value tasks and enabling data engineers and business analysts to drive data science projects.

dotData is a spin-off of NEC Corporation, and led by Dr. Ryohei Fujimaki, a world-renowned data scientist, and the youngest research fellow ever appointed in the 119-year history of NEC Corporation. The dotData team is delivering new levels of speed, scale and value in successful deployments across 10 industries, including Fortune Global 250 clients.

Comments

Leave a Reply

Your email address will not be published. Required fields are marked *