Xdecoder 10.5 __link__ [UHD]
X-Decoder 10.5 features an improved alignment between visual features and linguistic embeddings. This allows the model to perform "open-vocabulary" tasks with higher precision, meaning it can identify and segment objects it has never explicitly seen during supervised training, provided it understands the textual description.
