0

I am exploring the idea of the ensemble technique for the semantic segmentation model. I initially wanted to use a support vector machine combined with UNet/ResNet/DeepLabV3 for the last layer. I found that I could use 'hinge loss' as a loss function and it works the same as a support vector machine. But I'm a bit not clear how this will work and if this is the right way I'm trying to implement it. Since I do not have extensive knowledge in computer vision I might be completely wrong and digging the wrong hole. If that's the case please let me know.

What I was thinking is the outcome of Unet layer will be imagesize*imagesize with probabilities of class for either 0 or 1(before applying activation) and I wanted to apply a support vector machine with this all pixel vectors and then classify with the result.

my question will be

  1. I feel like using sigmoid as activation of last layer and selecting hinge loss for loss function is a little different to what I wanted to do. Is this correct?
  2. Can I use hinge loss for semantic segmentation(binary class) as well?
  3. If the aforementioned methods are all wrong and If I wanted to implement the ensemble technique(semantic segmentation model+SVM) how can I implement?
askedJul 7, 2022 at 14:51
dspeanut's user avatar
1
  • Please edit the question to limit it to a specific problem with enough detail to identify an adequate answer.CommentedJul 20, 2022 at 21:16

1 Answer1

0

I feel like using sigmoid as activation of last layer and selectinghinge loss for loss function is a little different to what I wanted todo. Is this correct?

Sigmoid makes sense as your last layer as you are working with a binary classification problem and this activation will squeeze the values between 0 and 1. For binary classification problems the default loss functions is binary cross entropy (BCE). Hinge is a possible alternative, but for the problems I have been working with it has very seldomly led to better results.

Can I use hinge loss for semantic segmentation(binary class) as well?

To be completely sure which loss function you should use for your specific problem you should train both with BCE and Hinge. However, I would start by BCE.

If the aforementioned methods are all wrong and If I wanted toimplement the ensemble technique(semantic segmentation model+SVM) howcan I implement?

Go for a pure neural network approach. You will get much better results

answeredJul 25, 2022 at 7:06
Mike B's user avatar
Sign up to request clarification or add additional context in comments.

1 Comment

my intention was to experiment of different combinations actually. but in the end I decided to take a different approach like you said this won't change much of the performance result. Thanks very much for the comment

Your Answer

Sign up orlog in

Sign up using Google
Sign up using Email and Password

Post as a guest

Required, but never shown

By clicking “Post Your Answer”, you agree to ourterms of service and acknowledge you have read ourprivacy policy.

Start asking to get answers

Find the answer to your question by asking.

Ask question

Explore related questions

See similar questions with these tags.