Integrating Human Vision Perception in Vision Transformers for Classifying Waste Items
In this paper, we propose an novel methodology aimed at simulating the learning phenomenon of nystagmus through the application of differential blurring on datasets. Nystagmus is a biological phenomenon that influences human vision throughout life, notably by diminishing head shake from infancy to a...
Gespeichert in:
Hauptverfasser: | , |
---|---|
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | In this paper, we propose an novel methodology aimed at simulating the
learning phenomenon of nystagmus through the application of differential
blurring on datasets. Nystagmus is a biological phenomenon that influences
human vision throughout life, notably by diminishing head shake from infancy to
adulthood. Leveraging this concept, we address the issue of waste
classification, a pressing global concern. The proposed framework comprises two
modules, with the second module closely resembling the original Vision
Transformer, a state-of-the-art model model in classification tasks. The
primary motivation behind our approach is to enhance the model's precision and
adaptability, mirroring the real-world conditions that the human visual system
undergoes. This novel methodology surpasses the standard Vision Transformer
model in waste classification tasks, exhibiting an improvement with a margin of
2%. This improvement underscores the potential of our methodology in improving
model precision by drawing inspiration from human vision perception. Further
research in the proposed methodology could yield greater performance results,
and can be extrapolated to other global issues. |
---|---|
DOI: | 10.48550/arxiv.2312.12143 |