# What are the right algorithms for this open loop control problem

Artificial Intelligence Asked by toben aus on August 30, 2020

i’m quiete new to AI, i have to choose the right algorithm for a machine learning problem. Since there are so many methods, algorithms and techniques, i’m not sure whether i’m on the right track.

The problem we want to tackle with machine learning goes something like this:
For each process we execute we get a certain performance. In each process our tool wears, which has an (unknown) effect on our performance. We also can adjust five parameters of our machine, which also have an (unknown) effect on our performance. we want to adjust the five parameters of our machine in a way, that performance is constant, even though tool wear is happening.

For learning, we would execute our process with different wear and machine parameter configurations and see how performance reacts. We will be able to execute quiete a big amount of experiments (500-1000)

To begin with, we assume that there are 20 different tool wear states (from sharp till dull), we have five parameters of the machine we can freely adjust.

To my understanding so far, this is an open loop control problem. We could use reinforcement learning to learn the relation of tool wear and machine parameters for a certain desired machine performance.

My questions:
Is reinforcement learning the correct method?
Which algorithm is used to identify the next machine parameter set for the next experiment in learning phase?
Which learning algorithm should i choose?

Any hints, tutorials, correct methods, links which help would be appreciated.

thanks+regards

although reinforcement learning has been used in control, it is not its most successful story, especially so for continuous control domains

First of all you must figure out the behavior of your system, that is how your "performance" adjusts with changing system parameters and your tool wear state. This is a supervised learning task, once you have the experiments in place, and can be tackled e.g. with a neural net

Answered by nikos on August 30, 2020

## Related Questions

### Classification or regression for deep Q learning

0  Asked on December 16, 2021

### Is the Bellman equation that uses sampling weighted by the Q values (instead of max) a contraction?

0  Asked on December 16, 2021 by sirfroggy

### Why does reinforcement learning using a non-linear function approximator diverge when using strongly correlated data as input?

1  Asked on December 13, 2021

### How Graph Convolutional Neural Networks forward propagate?

1  Asked on December 13, 2021

### In which cases is the categorical cross-entropy better than the mean squared error?

3  Asked on December 11, 2021

### What are the keys and values of the attention model for the encoder and decoder in the “Attention Is All You Need” paper?

1  Asked on December 11, 2021

### Is my 57% sports betting accuracy correct?

1  Asked on December 11, 2021 by sports_stats

### Understanding the “unroling” step in the proof of the policy gradient theorem

2  Asked on December 9, 2021

### Forcing a neural network to be close to a previous model – Regularization through given model

0  Asked on December 9, 2021 by blba

### Why is DDPG not learning and it does not converge?

0  Asked on December 9, 2021 by i_al-thamary

### How artificial intelligence will change the future?

1  Asked on December 7, 2021

### Can residual neural networks use other activation functions different from ReLU?

1  Asked on December 7, 2021 by jr123456jr987654321

### Is it necessary to standardise the expected output

1  Asked on December 7, 2021

### Is CNN capable of extracting the descriptive statistics features

1  Asked on December 4, 2021 by nilsinelabore

### How to create Partially Connected NNs with prespecified connections using Tensorflow?

3  Asked on December 2, 2021 by pnar-demetci

### What is the best resources to learn Graph Convolutional Neural Networks?

2  Asked on December 2, 2021

### Is it possible to use AI to reverse engineer software?

2  Asked on November 29, 2021 by ipsumpanest

### Why do CNN’s sometimes make highly confident mistakes, and how can one combat this problem?

6  Asked on November 29, 2021

### Can you explain me this CNN architecture?

1  Asked on November 29, 2021 by sanmu

### In Deep Deterministic Policy Gradient, are all weights of the policy network updated with the same or different value?

1  Asked on November 29, 2021 by unter_983