New submissions for Tue, 16 Aug 22

Keyword: out of distribution detection

There is no result

Keyword: out-of-distribution detection

There is no result

Keyword: expected calibration error

There is no result

Keyword: overconfident

There is no result

Keyword: overconfidence

There is no result

Keyword: confidence

Double Auctions with Two-sided Bandit Feedback

Authors: Soumya Basu, Abishek Sankararaman
Subjects: Machine Learning (cs.LG); Computer Science and Game Theory (cs.GT)
Arxiv link: https://arxiv.org/abs/2208.06536
Pdf link: https://arxiv.org/pdf/2208.06536
Abstract Double Auction enables decentralized transfer of goods between multiple buyers and sellers, thus underpinning functioning of many online marketplaces. Buyers and sellers compete in these markets through bidding, but do not often know their own valuation a-priori. As the allocation and pricing happens through bids, the profitability of participants, hence sustainability of such markets, depends crucially on learning respective valuations through repeated interactions. We initiate the study of Double Auction markets under bandit feedback on both buyers' and sellers' side. We show with confidence bound based bidding, and `Average Pricing' there is an efficient price discovery among the participants. In particular, the buyers and sellers exchanging goods attain $O(\sqrt{T})$ regret in $T$ rounds. The buyers and sellers who do not benefit from exchange in turn only experience $O(\log{T}/ \Delta)$ regret in $T$ rounds where $\Delta$ is the minimum price gap. We augment our upper bound by showing that even with a known fixed price of the good -- a simpler learning problem than Double Auction -- $\omega(\sqrt{T})$ regret is unattainable in certain markets.
Confidence Matters: Inspecting Backdoors in Deep Neural Networks via Distribution Transfer
Authors: Tong Wang, Yuan Yao, Feng Xu, Miao Xu, Shengwei An, Ting Wang
Subjects: Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV)
Arxiv link: https://arxiv.org/abs/2208.06592
Pdf link: https://arxiv.org/pdf/2208.06592
Abstract Backdoor attacks have been shown to be a serious security threat against deep learning models, and detecting whether a given model has been backdoored becomes a crucial task. Existing defenses are mainly built upon the observation that the backdoor trigger is usually of small size or affects the activation of only a few neurons. However, the above observations are violated in many cases especially for advanced backdoor attacks, hindering the performance and applicability of the existing defenses. In this paper, we propose a backdoor defense DTInspector built upon a new observation. That is, an effective backdoor attack usually requires high prediction confidence on the poisoned training samples, so as to ensure that the trained model exhibits the targeted behavior with a high probability. Based on this observation, DTInspector first learns a patch that could change the predictions of most high-confidence data, and then decides the existence of backdoor by checking the ratio of prediction changes after applying the learned patch on the low-confidence data. Extensive evaluations on five backdoor attacks, four datasets, and three advanced attacking types demonstrate the effectiveness of the proposed defense.
Multi-Attribute Open Set Recognition
Authors: Piyapat Saranrittichai, Chaithanya Kumar Mummadi, Claudia Blaiotta, Mauricio Munoz, Volker Fischer
Subjects: Computer Vision and Pattern Recognition (cs.CV)
Arxiv link: https://arxiv.org/abs/2208.06809
Pdf link: https://arxiv.org/pdf/2208.06809
Abstract Open Set Recognition (OSR) extends image classification to an open-world setting, by simultaneously classifying known classes and identifying unknown ones. While conventional OSR approaches can detect Out-of-Distribution (OOD) samples, they cannot provide explanations indicating which underlying visual attribute(s) (e.g., shape, color or background) cause a specific sample to be unknown. In this work, we introduce a novel problem setup that generalizes conventional OSR to a multi-attribute setting, where multiple visual attributes are simultaneously recognized. Here, OOD samples can be not only identified but also categorized by their unknown attribute(s). We propose simple extensions of common OSR baselines to handle this novel scenario. We show that these baselines are vulnerable to shortcuts when spurious correlations exist in the training dataset. This leads to poor OOD performance which, according to our experiments, is mainly due to unintended cross-attribute correlations of the predicted confidence scores. We provide an empirical evidence showing that this behavior is consistent across different baselines on both synthetic and real world datasets.
Confidence-Guided Learning Process for Continuous Classification of Time Series
Authors: Chenxi Sun, Moxian Song, Derun Can, Baofeng Zhang, Shenda Hong, Hongyan Li
Subjects: Machine Learning (cs.LG)
Arxiv link: https://arxiv.org/abs/2208.06883
Pdf link: https://arxiv.org/pdf/2208.06883
Abstract In the real world, the class of a time series is usually labeled at the final time, but many applications require to classify time series at every time point. e.g. the outcome of a critical patient is only determined at the end, but he should be diagnosed at all times for timely treatment. Thus, we propose a new concept: Continuous Classification of Time Series (CCTS). It requires the model to learn data in different time stages. But the time series evolves dynamically, leading to different data distributions. When a model learns multi-distribution, it always forgets or overfits. We suggest that meaningful learning scheduling is potential due to an interesting observation: Measured by confidence, the process of model learning multiple distributions is similar to the process of human learning multiple knowledge. Thus, we propose a novel Confidence-guided method for CCTS (C3TS). It can imitate the alternating human confidence described by the Dunning-Kruger Effect. We define the objective- confidence to arrange data, and the self-confidence to control the learning duration. Experiments on four real-world datasets show that C3TS is more accurate than all baselines for CCTS.
Keyword: scaling

VegaFusion: Automatic Server-Side Scaling for Interactive Vega Visualizations
Authors: Nicolas Kruchten, Jon Mease, Dominik Moritz
Subjects: Human-Computer Interaction (cs.HC)
Arxiv link: https://arxiv.org/abs/2208.06631
Pdf link: https://arxiv.org/pdf/2208.06631
Abstract The Vega grammar has been broadly adopted by a growing ecosystem of browser-based visualization tools. However, the reference Vega renderer does not scale well to large datasets (e.g., millions of rows or hundreds of megabytes) because it requires the entire dataset to be loaded into browser memory. We introduce VegaFusion, which brings automatic server-side scaling to the Vega ecosystem. VegaFusion accepts generic Vega specifications and partitions the required computation between the client and an out-of-browser, natively-compiled server-side process. Large datasets can be processed server-side to avoid loading them into the browser and to take advantage of multi-threading, more powerful server hardware and caching. We demonstrate how VegaFusion can be integrated into the existing Vega ecosystem, and show that VegaFusion greatly outperforms the reference implementation. We demonstrate these benefits with VegaFusion running on the same machine as the client as well as on a remote machine.
On the Formalization of the Heat Conduction Problem in HOL
Authors: Elif Deniz, Adnan Rashid, Osman Hasan, Sofiène Tahar
Subjects: Logic in Computer Science (cs.LO)
Arxiv link: https://arxiv.org/abs/2208.06642
Pdf link: https://arxiv.org/pdf/2208.06642
Abstract Partial Differential Equations (PDEs) are widely used for modeling the physical phenomena and analyzing the dynamical behavior of many engineering and physical systems. The heat equation is one of the most well-known PDEs that captures the temperature distribution and diffusion of heat within a body. Due to the wider utility of these equations in various safety-critical applications, such as thermal protection systems, a formal analysis of the heat transfer is of utmost importance. In this paper, we propose to use higher-order-logic (HOL) theorem proving for formally analyzing the heat conduction problem in rectangular coordinates. In particular, we formally model the heat transfer as a one-dimensional heat equation for a rectangular slab using the multivariable calculus theories of the HOL Light theorem prover. This requires the formalization of the heat operator and formal verification of its various properties, such as linearity and scaling. Moreover, we use the separation of variables method for formally verifying the solution of the PDEs, which allows modeling the heat transfer in the slab under various initial and boundary conditions using HOL Light.
DAOS as HPC Storage, a view from Numerical Weather Prediction
Authors: Nicolau Manubens (1), Tiago Quintino (1), Simon D. Smart (1), Emanuele Danovaro (1), Adrian Jackson (2) ((1) ECMWF, (2) EPCC, The University of Edinburgh)
Subjects: Distributed, Parallel, and Cluster Computing (cs.DC)
Arxiv link: https://arxiv.org/abs/2208.06752
Pdf link: https://arxiv.org/pdf/2208.06752
Abstract Novel object storage solutions potentially address long-standing scalability issues with POSIX file systems, and Storage Class Memory (SCM) offers promising performance characteristics for data-intensive use cases. Intel's Distributed Asynchronous Object Store (DAOS) is an emerging high-performance object store which can leverage SCM and NVMe devices. It has been gaining traction after scoring top positions in the I/O 500 benchmark. Numerical Weather Prediction (NWP) simulations are sensitive to I/O performance and scaling, and their output resolution and diversity is expected to increase significantly in the near future. In this work, we present a preliminary assessment of DAOS in conjunction with SCM on a research HPC system and evaluate its potential use as HPC storage at a world-leading weather forecasting centre. We demonstrate DAOS can provide the required performance, with bandwidth scaling linearly with additional SCM nodes in most cases, although choices in configuration and application design can impact achievable bandwidth. We describe a new I/O benchmark and associated metrics that address object storage performance from application-derived workloads that can be utilised to explore real-world performance for this new class of storage systems
Energy Savings When Migrating Workloads to the Cloud
Authors: Yan Zheng, Stephan Bohacek
Subjects: Distributed, Parallel, and Cluster Computing (cs.DC); Networking and Internet Architecture (cs.NI)
Arxiv link: https://arxiv.org/abs/2208.06976
Pdf link: https://arxiv.org/pdf/2208.06976
Abstract In the cloud environment, data centers are efficiently manipulated by cloud service providers (CSPs) in terms of energy consumption. Consequently, migrating workloads to clouds can result in lower energy consumption. This paper demonstrates that the Lift-and-Shift migration with optimal selections of cloud instances can provide significant energy savings, and explains how much and where the energy savings are obtained from. Additionally, the analysis on the variation of energy consumption is given when Auto-Scaling is deployed showing that further energy savings are expected even without refactoring applications. All the conclusions and analyses are based on the real data collected by Cloudamize Inc. from May 2016 to August 2016 over 40,000 machines across approximately 300 data centers.
An Efficient and Reliable Asynchronous Federated Learning Scheme for Smart Public Transportation
Authors: Chenhao Xu, Youyang Qu, Tom H. Luan, Peter W. Eklund, Yong Xiang, Longxiang Gao
Subjects: Machine Learning (cs.LG); Distributed, Parallel, and Cluster Computing (cs.DC)
Arxiv link: https://arxiv.org/abs/2208.07194
Pdf link: https://arxiv.org/pdf/2208.07194
Abstract Machine Learning (ML) is a distributed approach for training predictive models on the Internet of Vehicles (IoV) to enable smart public transportation. Since the traffic conditions change over time, the ML model that predicts traffic flows and the time passengers wait at stops must be updated continuously and efficiently. Federated learning (FL) is a distributed machine learning scheme that allows vehicles to receive continuous model updates without having to upload raw data to the cloud and wait for models to be trained. However, FL in smart public transportation is vulnerable to poisoning or DDoS attacks since vehicles travel in public. Besides, due to device heterogeneity and imbalanced data distributions, the synchronized aggregation strategy that collects local models from specific vehicles before aggregation is inefficient. Although Asynchronous Federated Learning (AFL) schemes are developed to improve efficiency by aggregating local models as soon as they are received, the stale local models remain unreasonably weighted, resulting in poor learning performance. To enable smarter public transportation, this paper offers a blockchain-based asynchronous federated learning scheme with a dynamic scaling factor (DBAFL). Specifically, the novel committee-based consensus algorithm for blockchain improves reliability at the lowest possible cost of time. Meanwhile, the devised dynamic scaling factor allows AFL to assign reasonable weight to stale local models. Extensive experiments conducted on heterogeneous devices validate outperformed learning performance, efficiency, and reliability of DBAFL.
Keyword: calibration

Look Both Ways: Bidirectional Visual Sensing for Automatic Multi-Camera Registration
Authors: Subodh Mishra, Sushruth Nagesh, Sagar Manglani, Graham Mills, Punarjay Chakravarty, Gaurav Pandey
Subjects: Robotics (cs.RO)
Arxiv link: https://arxiv.org/abs/2208.07362
Pdf link: https://arxiv.org/pdf/2208.07362
Abstract This work describes the automatic registration of a large network (approximately 40) of fixed, ceiling-mounted environment cameras spread over a large area (approximately 800 squared meters) using a mobile calibration robot equipped with a single upward-facing fisheye camera and a backlit ArUco marker for easy detection. The fisheye camera is used to do visual odometry (VO), and the ArUco marker facilitates easy detection of the calibration robot in the environment cameras. In addition, the fisheye camera is also able to detect the environment cameras. This two-way, bidirectional detection constrains the pose of the environment cameras to solve an optimization problem. Such an approach can be used to automatically register a large-scale multi-camera system used for surveillance, automated parking, or robotic applications. This VO based multicamera registration method is extensively validated using real-world experiments, and also compared against a similar approach which uses an LiDAR - an expensive, heavier and power hungry sensor.

ericbeyer / L-arxiv-interest-tracker

New submissions for Tue, 16 Aug 22 #601

Keyword: out of distribution detection

Keyword: out-of-distribution detection

Keyword: expected calibration error

Keyword: overconfident

Keyword: overconfidence

Keyword: confidence

Double Auctions with Two-sided Bandit Feedback

Confidence Matters: Inspecting Backdoors in Deep Neural Networks via Distribution Transfer

Multi-Attribute Open Set Recognition

Confidence-Guided Learning Process for Continuous Classification of Time Series

Keyword: scaling

VegaFusion: Automatic Server-Side Scaling for Interactive Vega Visualizations

On the Formalization of the Heat Conduction Problem in HOL

DAOS as HPC Storage, a view from Numerical Weather Prediction

Energy Savings When Migrating Workloads to the Cloud

An Efficient and Reliable Asynchronous Federated Learning Scheme for Smart Public Transportation

Keyword: calibration

Look Both Ways: Bidirectional Visual Sensing for Automatic Multi-Camera Registration