New submissions for Mon, 8 Aug 22

Keyword: out of distribution detection

There is no result

Keyword: out-of-distribution detection

Task-agnostic Continual Hippocampus Segmentation for Smooth Population Shifts

Authors: Camila Gonzalez, Amin Ranem, Ahmed Othman, Anirban Mukhopadhyay
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computers and Society (cs.CY); Machine Learning (cs.LG)
Arxiv link: https://arxiv.org/abs/2208.03206
Pdf link: https://arxiv.org/pdf/2208.03206
Abstract Most continual learning methods are validated in settings where task boundaries are clearly defined and task identity information is available during training and testing. We explore how such methods perform in a task-agnostic setting that more closely resembles dynamic clinical environments with gradual population shifts. We propose ODEx, a holistic solution that combines out-of-distribution detection with continual learning techniques. Validation on two scenarios of hippocampus segmentation shows that our proposed method reliably maintains performance on earlier tasks without losing plasticity.
Keyword: expected calibration error

There is no result

Keyword: overconfident

There is no result

Keyword: overconfidence

There is no result

Keyword: confidence

Memory-Guided Collaborative Attention for Nighttime Thermal Infrared Image Colorization
Authors: Fu-Ya Luo, Yi-Jun Cao, Kai-Fu Yang, Yong-Jie Li
Subjects: Computer Vision and Pattern Recognition (cs.CV)
Arxiv link: https://arxiv.org/abs/2208.02960
Pdf link: https://arxiv.org/pdf/2208.02960
Abstract Nighttime thermal infrared (NTIR) image colorization, also known as translation of NTIR images into daytime color images (NTIR2DC), is a promising research direction to facilitate nighttime scene perception for humans and intelligent systems under unfavorable conditions (e.g., complete darkness). However, previously developed methods have poor colorization performance for small sample classes. Moreover, reducing the high confidence noise in pseudo-labels and addressing the problem of image gradient disappearance during translation are still under-explored, and keeping edges from being distorted during translation is also challenging. To address the aforementioned issues, we propose a novel learning framework called Memory-guided cOllaboRative atteNtion Generative Adversarial Network (MornGAN), which is inspired by the analogical reasoning mechanisms of humans. Specifically, a memory-guided sample selection strategy and adaptive collaborative attention loss are devised to enhance the semantic preservation of small sample categories. In addition, we propose an online semantic distillation module to mine and refine the pseudo-labels of NTIR images. Further, conditional gradient repair loss is introduced for reducing edge distortion during translation. Extensive experiments on the NTIR2DC task show that the proposed MornGAN significantly outperforms other image-to-image translation methods in terms of semantic preservation and edge consistency, which helps improve the object detection accuracy remarkably.
Human Saliency-Driven Patch-based Matching for Interpretable Post-mortem Iris Recognition
Authors: Aidan Boyd, Daniel Moreira, Andrey Kuehlkamp, Kevin Bowyer, Adam Czajka
Subjects: Computer Vision and Pattern Recognition (cs.CV)
Arxiv link: https://arxiv.org/abs/2208.03138
Pdf link: https://arxiv.org/pdf/2208.03138
Abstract Forensic iris recognition, as opposed to live iris recognition, is an emerging research area that leverages the discriminative power of iris biometrics to aid human examiners in their efforts to identify deceased persons. As a machine learning-based technique in a predominantly human-controlled task, forensic recognition serves as "back-up" to human expertise in the task of post-mortem identification. As such, the machine learning model must be (a) interpretable, and (b) post-mortem-specific, to account for changes in decaying eye tissue. In this work, we propose a method that satisfies both requirements, and that approaches the creation of a post-mortem-specific feature extractor in a novel way employing human perception. We first train a deep learning-based feature detector on post-mortem iris images, using annotations of image regions highlighted by humans as salient for their decision making. In effect, the method learns interpretable features directly from humans, rather than purely data-driven features. Second, regional iris codes (again, with human-driven filtering kernels) are used to pair detected iris patches, which are translated into pairwise, patch-based comparison scores. In this way, our method presents human examiners with human-understandable visual cues in order to justify the identification decision and corresponding confidence score. When tested on a dataset of post-mortem iris images collected from 259 deceased subjects, the proposed method places among the three best iris matchers, demonstrating better results than the commercial (non-human-interpretable) VeriEye approach. We propose a unique post-mortem iris recognition method trained with human saliency to give fully-interpretable comparison outcomes for use in the context of forensic examination, achieving state-of-the-art recognition performance.
Keyword: scaling

Latent Multi-Relation Reasoning for GAN-Prior based Image Super-Resolution
Authors: Jiahui Zhang, Fangneng Zhan, Yingchen Yu, Rongliang Wu, Xiaoqin Zhang, Shijian Lu
Subjects: Computer Vision and Pattern Recognition (cs.CV)
Arxiv link: https://arxiv.org/abs/2208.02861
Pdf link: https://arxiv.org/pdf/2208.02861
Abstract Recently, single image super-resolution (SR) under large scaling factors has witnessed impressive progress by introducing pre-trained generative adversarial networks (GANs) as priors. However, most GAN-Priors based SR methods are constrained by an attribute disentanglement problem in inverted latent codes which directly leads to mismatches of visual attributes in the generator layers and further degraded reconstruction. In addition, stochastic noises fed to the generator are employed for unconditional detail generation, which tends to produce unfaithful details that compromise the fidelity of the generated SR image. We design LAREN, a LAtent multi-Relation rEasoNing technique that achieves superb large-factor SR through graph-based multi-relation reasoning in latent space. LAREN consists of two innovative designs. The first is graph-based disentanglement that constructs a superior disentangled latent space via hierarchical multi-relation reasoning. The second is graph-based code generation that produces image-specific codes progressively via recursive relation reasoning which enables prior GANs to generate desirable image details. Extensive experiments show that LAREN achieves superior large-factor image SR and outperforms the state-of-the-art consistently across multiple benchmarks.
Analytical disk-cylinder interaction potential laws for the computational modeling of adhesive, deformable (nano)fibers
Authors: Maximilian J. Grill, Wolfgang A. Wall, Christoph Meier
Subjects: Computational Engineering, Finance, and Science (cs.CE)
Arxiv link: https://arxiv.org/abs/2208.03074
Pdf link: https://arxiv.org/pdf/2208.03074
Abstract The analysis of complex fibrous systems or materials on the micro- and nanoscale, which have a high practical relevance for many technical or biological systems, requires accurate analytical descriptions of the adhesive and repulsive forces acting on the fiber surfaces. While such analytical expressions are generally needed both for theoretical studies and for computer-based simulations, the latter motivates us here to derive disk-cylinder interaction potential laws that are valid for arbitrary mutual orientations in the decisive regime of small surface separations. The chosen type of fundamental point-pair interaction follows the simple Lennard-Jones model with inverse power laws for both the adhesive van der Waals part and the steric, repulsive part. We present three different solutions, ranging from highest accuracy to the best trade-off between simplicity of the expression and sufficient accuracy for our intended use. The validity of simplifying approximations and the accuracy of the derived potential laws is thoroughly analyzed, using both numerical and analytical reference solutions for specific interaction cases. Most importantly, the correct asymptotic scaling behavior in the decisive regime of small separations is achieved, and also the theoretically predicted $(1!/!\sin!\alpha)$-angle dependence (for non-parallel cylinders) is obtained by the proposed analytical solutions. As we show in the outlook to our current research, the derived analytical disk-cylinder interaction potential laws may be used to formulate highly efficient computational models for the interaction of arbitrarily curved fibers, such that the disk represents the cross-section of the first and the cylinder a local approximation to the shape of the second fiber.
Asymptotically consistent and computationally efficient modeling of short-ranged molecular interactions between curved slender fibers undergoing large 3D deformations
Authors: Maximilian J. Grill, Wolfgang A. Wall, Christoph Meier
Subjects: Computational Engineering, Finance, and Science (cs.CE)
Arxiv link: https://arxiv.org/abs/2208.03149
Pdf link: https://arxiv.org/pdf/2208.03149
Abstract This article proposes a novel computational modeling approach for short-ranged molecular interactions between curved slender fibers undergoing large 3D deformations, and gives a detailed overview how it fits into the framework of existing fiber or beam interaction models, either considering microscale molecular or macroscale contact effects. The direct evaluation of a molecular interaction potential between two general bodies in 3D space would require to integrate molecule densities over two 3D volumes, leading to a sixfold integral to be solved numerically. By exploiting the short-range nature of the considered class of interaction potentials as well as the fundamental kinematic assumption of undeformable fiber cross-sections, as typically applied in mechanical beam theories, a recently derived, closed-form analytical solution is applied for the interaction potential between a given section of the first fiber (slave beam) and the entire second fiber (master beam). This novel approach based on a pre-defined section-beam interaction potential (SBIP) requires only one single integration step along the slave beam length to be performed numerically. In terms of accuracy, the total beam-beam interaction potential resulting from this approach is shown to exhibit an asymptotically consistent angular and distance scaling behavior. In addition to elementary two-fiber systems, carefully chosen to verify accuracy and asymptotic consistence of the proposed SBIP approach, a potential practical application in form of adhesive nanofiber-grafted surfaces is studied. Involving a large number of helicoidal fibers undergoing large 3D deformations, arbitrary mutual fiber orientations as well as frequent local fiber pull-off and snap-into-contact events, this example demonstrates the robustness and computational efficiency of the new approach.
Almost-Orthogonal Layers for Efficient General-Purpose Lipschitz Networks
Authors: Bernd Prach, Christoph H. Lampert
Subjects: Machine Learning (cs.LG)
Arxiv link: https://arxiv.org/abs/2208.03160
Pdf link: https://arxiv.org/pdf/2208.03160
Abstract It is a highly desirable property for deep networks to be robust against small input changes. One popular way to achieve this property is by designing networks with a small Lipschitz constant. In this work, we propose a new technique for constructing such Lipschitz networks that has a number of desirable properties: it can be applied to any linear network layer (fully-connected or convolutional), it provides formal guarantees on the Lipschitz constant, it is easy to implement and efficient to run, and it can be combined with any training objective and optimization method. In fact, our technique is the first one in the literature that achieves all of these properties simultaneously. Our main contribution is a rescaling-based weight matrix parametrization that guarantees each network layer to have a Lipschitz constant of at most 1 and results in the learned weight matrices to be close to orthogonal. Hence we call such layers almost-orthogonal Lipschitz (AOL). Experiments and ablation studies in the context of image classification with certified robust accuracy confirm that AOL layers achieve results that are on par with most existing methods. Yet, they are simpler to implement and more broadly applicable, because they do not require computationally expensive matrix orthogonalization or inversion steps as part of the network architecture. We provide code at https://github.com/berndprach/AOL.
Branch-Train-Merge: Embarrassingly Parallel Training of Expert Language Models
Authors: Margaret Li, Suchin Gururangan, Tim Dettmers, Mike Lewis, Tim Althoff, Noah A. Smith, Luke Zettlemoyer
Subjects: Computation and Language (cs.CL)
Arxiv link: https://arxiv.org/abs/2208.03306
Pdf link: https://arxiv.org/pdf/2208.03306
Abstract We present Branch-Train-Merge (BTM), a communication-efficient algorithm for embarrassingly parallel training of large language models (LLMs). We show it is possible to independently train subparts of a new class of LLMs on different subsets of the data, eliminating the massive multi-node synchronization currently required to train LLMs. BTM learns a set of independent expert LMs (ELMs), each specialized to a different textual domain, such as scientific or legal text. These ELMs can be added and removed to update data coverage, ensembled to generalize to new domains, or averaged to collapse back to a single LM for efficient inference. New ELMs are learned by branching from (mixtures of) ELMs in the current set, further training the parameters on data for the new domain, and then merging the resulting model back into the set for future use. Experiments show that BTM improves in- and out-of-domain perplexities as compared to GPT-style Transformer LMs, when controlling for training cost. Through extensive analysis, we show that these results are robust to different ELM initialization schemes, but require expert domain specialization; LM ensembles with random data splits do not perform well. We also present a study of scaling BTM into a new corpus of 64 domains (192B whitespace-separated tokens in total); the resulting LM (22.4B total parameters) performs as well as a Transformer LM trained with 2.5 times more compute. These gains grow with the number of domains, suggesting more aggressive parallelism could be used to efficiently train larger models in future work.
Keyword: calibration

Improved post-hoc probability calibration for out-of-domain MRI segmentation
Authors: Cheng Ouyang, Shuo Wang, Chen Chen, Zeju Li, Wenjia Bai, Bernhard Kainz, Daniel Rueckert
Subjects: Computer Vision and Pattern Recognition (cs.CV)
Arxiv link: https://arxiv.org/abs/2208.02870
Pdf link: https://arxiv.org/pdf/2208.02870
Abstract Probability calibration for deep models is highly desirable in safety-critical applications such as medical imaging. It makes output probabilities of deep networks interpretable, by aligning prediction probabilities with the actual accuracy in test data. In image segmentation, well-calibrated probabilities allow radiologists to identify regions where model-predicted segmentations are unreliable. These unreliable predictions often occur to out-of-domain (OOD) images that are caused by imaging artifacts or unseen imaging protocols. Unfortunately, most previous calibration methods for image segmentation perform sub-optimally on OOD images. To reduce the calibration error when confronted with OOD images, we propose a novel post-hoc calibration model. Our model leverages the pixel susceptibility against perturbations at the local level, and the shape prior information at the global level. The model is tested on cardiac MRI segmentation datasets that contain unseen imaging artifacts and images from an unseen imaging protocol. We demonstrate reduced calibration errors compared with the state-of-the-art calibration algorithm.

ericbeyer / L-arxiv-interest-tracker

New submissions for Mon, 8 Aug 22 #593

Keyword: out of distribution detection

Keyword: out-of-distribution detection

Task-agnostic Continual Hippocampus Segmentation for Smooth Population Shifts

Keyword: expected calibration error

Keyword: overconfident

Keyword: overconfidence

Keyword: confidence

Memory-Guided Collaborative Attention for Nighttime Thermal Infrared Image Colorization

Human Saliency-Driven Patch-based Matching for Interpretable Post-mortem Iris Recognition

Keyword: scaling

Latent Multi-Relation Reasoning for GAN-Prior based Image Super-Resolution

Analytical disk-cylinder interaction potential laws for the computational modeling of adhesive, deformable (nano)fibers

Asymptotically consistent and computationally efficient modeling of short-ranged molecular interactions between curved slender fibers undergoing large 3D deformations

Almost-Orthogonal Layers for Efficient General-Purpose Lipschitz Networks

Branch-Train-Merge: Embarrassingly Parallel Training of Expert Language Models

Keyword: calibration

Improved post-hoc probability calibration for out-of-domain MRI segmentation