Ingredient-Level Food Image Segmentation for Nutrition Awareness

arXiv:2606.24059v1 Announce Type: new Abstract: Food images often contain several visible ingredients, so assigning one dish label to an entire image hides important visual structure. This work studies ingredient-level semantic segmentation on FoodSeg103, where the model predicts an ingredient class for each pixel. Two SegFormer variants were fine-tuned and evaluated under a controlled setup: SegFormer-B0 as the smaller baseline model and SegFormer-B1 as the larger final model. Both models use I...

arXiv cs.CV ·Jonesh Shrestha ·
compartilhar: