Uploaded image for project: 'Machine Learning Library'
  1. Machine Learning Library
  2. ML-458

Implement a Preprocessing Bundle for HPCC Systems Machine Learning Library

    Details

    • Type: New Feature
    • Status: Open
    • Priority: Not specified
    • Resolution: Unresolved
    • Affects Version/s: None
    • Fix Version/s: None
    • Component/s: None
    • Labels:

      Description

      Produce a pre-processing bundle as part of the HPCC System Machine Learning Library that assists the user in performing some of the basic tasks of preparing their data for use with various ML Algorithms. The aim is to create tools in ECL to be added to the HPCC Systems machine learning library in the form of bundle to prepare data. Some examples of tools to be added are

      • One-hot encoding/decoding
      • Variable normalization and standardization
      • Scaling
      • Various sampling methods
      • Other important pre-processing tasks identified during the course of the project

      The project is open to accepting other suggested tools that users of the HPCC Systems ML library may find useful.

      Completion of this project involves:

      • Implementation of proposed pre-processing tools in ECL
      • Unit Testing
      • Code check in on Github
      • Documentation
      • White Paper

        Attachments

          Activity

            People

            • Assignee:
              vzeufack Vannel Zeufack
              Reporter:
              lorraineachapman Lorraine Chapman
            • Votes:
              0 Vote for this issue
              Watchers:
              2 Start watching this issue

              Dates

              • Created:
                Updated: