Simplify Your Data Science Workflow

Data comes in all shapes and sizes, yet unlocking actionable insight efficiently requires deep knowledge of data science techniques. BIOVIA Pipeline Pilot Machine Learning and Analytics Collection provides a comprehensive set of machine learning and data modeling capabilities to streamline your data science initiatives.

Analyze data, train and retrain models, and deploy your automated solution to useful enterprise applications.

Developing machine learning solutions often requires complex software architectures and deep statistical knowledge. With BIOVIA Pipeline Pilot Analytics and Machine Learning Collection, developers and end users alike can incorporate the latest machine learning techniques to their workflows with just a few clicks. No coding required.

Key Benefits

  • Merge, join, characterize, and clean your data sets
  • Apply any of 15+ machine learning (ML) methods to your scientific and engineering data
    • Use R-based ML methods such as support vector machines, neural networks, and XGBoost without writing R scripts
    • Use Python ML libraries including scikit-learn and TensorFlow
  • Rapidly apply statistical analyses
    • Use regression and classification model evaluation viewers to assess and compare model test set performance
    • Build fast, scalable Bayesian classification models
    • Use the GFA method’s genetic algorithm for variable selection and building regression ensemble models
    • Build accurate, easy-to-use RP Forest regression and classification models
  • Curate model performance
    • Deploy model applicability domain (MAD) methods and cross-validation
    • Employ the ML framework for cross-validation, hyperparameter tuning, and variable importance assessment for any type of model
  • Work flexibly
    • Support for 3rd party statistic platforms and tools such as Jupyter Notebook, R, JMP and SAS
  • Read in discipline-specific data
    • Purpose-built to support various numerical, chemical, biological, textual, and image data types
    • Use built-in applicability domain measures and error models to assess sample-specific prediction confidence
  • Optimize predictions
    • Train multiple trial models in parallel to identify top performers or combine multiple models into a single ensemble model
  • Simplify multi-objective optimization
    • Employ methods such as Pareto optimization to multi-objective optimization problems
  • Visualize results in workflow
    • Generate interactive reports with ROC plots, enrichment plots and other visualization techniques
    • Perform exploratory analysis, including PCA, clustering, and multi-dimensional data visualization 

Start Your Journey

The world of machine learning and analytics is changing. Discover how to stay a step ahead with BIOVIA

Join the conversation in the BIOVIA Pipeline Pilot User Community!

Also Discover

Biology Collections
Actionable Insights from Biology Data
Chemistry Collections
A Comprehensive Suite of Applications for Chemistry Research
Imaging Collection
Advanced Image and Video Analytics with ML and Deep Learning
Document Search & Analysis Collections
Scientifically-aware Text Mining Capabilities
Laboratory Analytics Collections
Advancing Beyond a Digitalized Lab
Scientific Informatics
Transform Scientific Data into Knowledge
BIOVIA Pipeline Pilot
Accelerate Innovation in Science and Engineering with AI and Machine Learning

Learn What BIOVIA Can Do for You

Speak with a BIOVIA expert to learn how our solutions enable seamless collaboration and sustainable innovation at organizations of every size.

Get Started

Courses and classes are available for students, academia, professionals and companies. Find the right BIOVIA training for you. 

Get Help

Find information on software & hardware certification, software downloads, user documentation, support contact and services offering