New submissions for Tue, 13 Sep 22

Keyword: out of distribution detection

There is no result

Keyword: out-of-distribution detection

There is no result

Keyword: expected calibration error

There is no result

Keyword: overconfident

There is no result

Keyword: overconfidence

There is no result

Keyword: confidence

Fairness in the Autobidding World with Machine-learned Advice

Authors: Yuan Deng, Negin Golrezaei, Patrick Jaillet, Jason Cheuk Nam Liang, Vahab Mirrokni
Subjects: Computer Science and Game Theory (cs.GT)
Arxiv link: https://arxiv.org/abs/2209.04748
Pdf link: https://arxiv.org/pdf/2209.04748
Abstract The increasing availability of real-time data has fueled the prevalence of algorithmic bidding (or autobidding) in online advertising markets, and has enabled online ad platforms to produce signals through machine learning techniques (i.e., ML advice) on advertisers' true perceived values for ad conversions. Previous works have studied the auction design problem while incorporating ML advice through various forms to improve total welfare of advertisers. Yet, such improvements could come at the cost of individual bidders' welfare, consequently eroding fairness of the ad platform. Motivated by this, we study how ad platforms can utilize ML advice to improve welfare guarantees and fairness on the individual bidder level in the autobidding world. We focus on a practical setting where ML advice takes the form of lower confidence bounds (or confidence intervals). We motivate a simple approach that directly sets such advice as personalized reserve prices when the platform consists of value-maximizing autobidders who are subject to return-on-ad spent (ROAS) constraints competing in multiple parallel auctions. Under parallel VCG auctions with ML advice-based reserves, we present a worst-case welfare lower-bound guarantee for individual agents, and show that platform fairness is positively correlated with ML advice quality. We also present an instance that demonstrates our welfare guarantee is tight. Further, we prove an impossibility result showing that no truthful, possibly randomized mechanism with anonymous allocations and ML advice as personalized reserves can achieve universally better fairness guarantees than VCG when coupled with ML advice of the same quality. Finally, we extend our fairness guarantees with ML advice to generalized first price (GFP) and generalized second price (GSP) auctions.
Detecting Driver Drowsiness as an Anomaly Using LSTM Autoencoders
Authors: Gülin Tüfekci, Alper Kayabaşi, Erdem Akagündüz, İlkay Ulusoy
Subjects: Computer Vision and Pattern Recognition (cs.CV)
Arxiv link: https://arxiv.org/abs/2209.05269
Pdf link: https://arxiv.org/pdf/2209.05269
Abstract In this paper, an LSTM autoencoder-based architecture is utilized for drowsiness detection with ResNet-34 as feature extractor. The problem is considered as anomaly detection for a single subject; therefore, only the normal driving representations are learned and it is expected that drowsiness representations, yielding higher reconstruction losses, are to be distinguished according to the knowledge of the network. In our study, the confidence levels of normal and anomaly clips are investigated through the methodology of label assignment such that training performance of LSTM autoencoder and interpretation of anomalies encountered during testing are analyzed under varying confidence rates. Our method is experimented on NTHU-DDD and benchmarked with a state-of-the-art anomaly detection method for driver drowsiness. Results show that the proposed model achieves detection rate of 0.8740 area under curve (AUC) and is able to provide significant improvements on certain scenarios.
Keyword: scaling

Examining stability of machine learning methods for predicting dementia at early phases of the disease
Authors: Sinan Faouri, Mahmood AlBashayreh, Mohammad Azzeh
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
Arxiv link: https://arxiv.org/abs/2209.04643
Pdf link: https://arxiv.org/pdf/2209.04643
Abstract Dementia is a neuropsychiatric brain disorder that usually occurs when one or more brain cells stop working partially or at all. Diagnosis of this disorder in the early phases of the disease is a vital task to rescue patients lives from bad consequences and provide them with better healthcare. Machine learning methods have been proven to be accurate in predicting dementia in the early phases of the disease. The prediction of dementia depends heavily on the type of collected data which usually are gathered from Normalized Whole Brain Volume (nWBV) and Atlas Scaling Factor (ASF) which are normally measured and corrected from Magnetic Resonance Imaging (MRIs). Other biological features such as age and gender can also help in the diagnosis of dementia. Although many studies use machine learning for predicting dementia, we could not reach a conclusion on the stability of these methods for which one is more accurate under different experimental conditions. Therefore, this paper investigates the conclusion stability regarding the performance of machine learning algorithms for dementia prediction. To accomplish this, a large number of experiments were run using 7 machine learning algorithms and two feature reduction algorithms namely, Information Gain (IG) and Principal Component Analysis (PCA). To examine the stability of these algorithms, thresholds of feature selection were changed for the IG from 20% to 100% and the PCA dimension from 2 to 8. This has resulted in 7x9 + 7x7= 112 experiments. In each experiment, various classification evaluation data were recorded. The obtained results show that among seven algorithms the support vector machine and Naive Bayes are the most stable algorithms while changing the selection threshold. Also, it was found that using IG would seem more efficient than using PCA for predicting Dementia.
Graph Polynomial Convolution Models for Node Classification of Non-Homophilous Graphs
Authors: Kishan Wimalawarne, Taiji Suzuki
Subjects: Machine Learning (cs.LG); Social and Information Networks (cs.SI)
Arxiv link: https://arxiv.org/abs/2209.05020
Pdf link: https://arxiv.org/pdf/2209.05020
Abstract We investigate efficient learning from higher-order graph convolution and learning directly from adjacency matrices for node classification. We revisit the scaled graph residual network and remove ReLU activation from residual layers and apply a single weight matrix at each residual layer. We show that the resulting model lead to new graph convolution models as a polynomial of the normalized adjacency matrix, the residual weight matrix, and the residual scaling parameter. Additionally, we propose adaptive learning between directly graph polynomial convolution models and learning directly from the adjacency matrix. Furthermore, we propose fully adaptive models to learn scaling parameters at each residual layer. We show that generalization bounds of proposed methods are bounded as a polynomial of eigenvalue spectrum, scaling parameters, and upper bounds of residual weights. By theoretical analysis, we argue that the proposed models can obtain improved generalization bounds by limiting the higher-orders of convolutions and direct learning from the adjacency matrix. Using a wide set of real-data, we demonstrate that the proposed methods obtain improved accuracy for node-classification of non-homophilous graphs.
Continual learning benefits from multiple sleep mechanisms: NREM, REM, and Synaptic Downscaling
Authors: Brian S. Robinson, Clare W. Lau, Alexander New, Shane M. Nichols, Erik C. Johnson, Michael Wolmetz, William G. Coon
Subjects: Neural and Evolutionary Computing (cs.NE); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
Arxiv link: https://arxiv.org/abs/2209.05245
Pdf link: https://arxiv.org/pdf/2209.05245
Abstract Learning new tasks and skills in succession without losing prior learning (i.e., catastrophic forgetting) is a computational challenge for both artificial and biological neural networks, yet artificial systems struggle to achieve parity with their biological analogues. Mammalian brains employ numerous neural operations in support of continual learning during sleep. These are ripe for artificial adaptation. Here, we investigate how modeling three distinct components of mammalian sleep together affects continual learning in artificial neural networks: (1) a veridical memory replay process observed during non-rapid eye movement (NREM) sleep; (2) a generative memory replay process linked to REM sleep; and (3) a synaptic downscaling process which has been proposed to tune signal-to-noise ratios and support neural upkeep. We find benefits from the inclusion of all three sleep components when evaluating performance on a continual learning CIFAR-100 image classification benchmark. Maximum accuracy improved during training and catastrophic forgetting was reduced during later tasks. While some catastrophic forgetting persisted over the course of network training, higher levels of synaptic downscaling lead to better retention of early tasks and further facilitated the recovery of early task accuracy during subsequent training. One key takeaway is that there is a trade-off at hand when considering the level of synaptic downscaling to use - more aggressive downscaling better protects early tasks, but less downscaling enhances the ability to learn new tasks. Intermediate levels can strike a balance with the highest overall accuracies during training. Overall, our results both provide insight into how to adapt sleep components to enhance artificial continual learning systems and highlight areas for future neuroscientific sleep research to further such systems.
Fast-Response Variable Frequency DC-DC Converters Using Switching Cycle Event-Driven Digital Control
Authors: Xiaofan Cui, Al-Thaddeus Avestruz
Subjects: Systems and Control (eess.SY)
Arxiv link: https://arxiv.org/abs/2209.05272
Pdf link: https://arxiv.org/pdf/2209.05272
Abstract This paper investigates a new method to model and control variable-frequency power converters in a switching-synchronized sampled-state space for cycle-by-cycle digital control. There are a number of significant benefits in comparison to other methods including fast dynamic performance together with ease of design and implementation. Theoretical results are presented and verified through hardware, and simulations of a current-mode buck converter with constant on-time and a current-mode boost converter with constant off-time. Dynamic voltage scaling for microprocessors and LiDAR are among the applications that can benefit.
Keyword: calibration

Multi-modal Streaming 3D Object Detection
Authors: Mazen Abdelfattah, Kaiwen Yuan, Z. Jane Wang, Rabab Ward
Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
Arxiv link: https://arxiv.org/abs/2209.04966
Pdf link: https://arxiv.org/pdf/2209.04966
Abstract Modern autonomous vehicles rely heavily on mechanical LiDARs for perception. Current perception methods generally require 360{\deg} point clouds, collected sequentially as the LiDAR scans the azimuth and acquires consecutive wedge-shaped slices. The acquisition latency of a full scan (~ 100ms) may lead to outdated perception which is detrimental to safe operation. Recent streaming perception works proposed directly processing LiDAR slices and compensating for the narrow field of view (FOV) of a slice by reusing features from preceding slices. These works, however, are all based on a single modality and require past information which may be outdated. Meanwhile, images from high-frequency cameras can support streaming models as they provide a larger FoV compared to a LiDAR slice. However, this difference in FoV complicates sensor fusion. To address this research gap, we propose an innovative camera-LiDAR streaming 3D object detection framework that uses camera images instead of past LiDAR slices to provide an up-to-date, dense, and wide context for streaming perception. The proposed method outperforms prior streaming models on the challenging NuScenes benchmark. It also outperforms powerful full-scan detectors while being much faster. Our method is shown to be robust to missing camera images, narrow LiDAR slices, and small camera-LiDAR miscalibration.
On the Factory Floor: ML Engineering for Industrial-Scale Ads Recommendation Models
Authors: Rohan Anil, Sandra Gadanho, Da Huang, Nijith Jacob, Zhuoshu Li, Dong Lin, Todd Phillips, Cristina Pop, Kevin Regan, Gil I. Shamir, Rakesh Shivanna, Qiqi Yan
Subjects: Information Retrieval (cs.IR); Machine Learning (cs.LG)
Arxiv link: https://arxiv.org/abs/2209.05310
Pdf link: https://arxiv.org/pdf/2209.05310
Abstract For industrial-scale advertising systems, prediction of ad click-through rate (CTR) is a central problem. Ad clicks constitute a significant class of user engagements and are often used as the primary signal for the usefulness of ads to users. Additionally, in cost-per-click advertising systems where advertisers are charged per click, click rate expectations feed directly into value estimation. Accordingly, CTR model development is a significant investment for most Internet advertising companies. Engineering for such problems requires many machine learning (ML) techniques suited to online learning that go well beyond traditional accuracy improvements, especially concerning efficiency, reproducibility, calibration, credit attribution. We present a case study of practical techniques deployed in Google's search ads CTR model. This paper provides an industry case study highlighting important areas of current ML research and illustrating how impactful new ML methods are evaluated and made useful in a large-scale industrial setting.
Analysis and Comparison of Classification Metrics
Authors: Luciana Ferrer
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
Arxiv link: https://arxiv.org/abs/2209.05355
Pdf link: https://arxiv.org/pdf/2209.05355
Abstract A number of different performance metrics are commonly used in the machine learning literature for classification systems that output categorical decisions. Some of the most common ones are accuracy, total error (one minus accuracy), balanced accuracy, balanced total error (one minus balanced accuracy), F-score, and Matthews correlation coefficient (MCC). In this document, we review the definition of these metrics and compare them with the expected cost (EC), a metric introduced in every statistical learning course but rarely used in the machine learning literature. We show that the empirical estimate of the EC is a generalized version of both the total error and balanced total error. Further, we show its relation with F-score and MCC and argue that EC is superior to them, being more general, simpler, intuitive and well motivated. We highlight some issues with the F-score and the MCC that make them suboptimal metrics. While not explained in the current version of this manuscript, where we focus exclusively on metrics that are computed over hard decisions, the EC has the additional advantage of being a great tool to measure calibration of a system's scores and allows users to make optimal decisions given a set of posteriors for each class. We leave that discussion for a future version of this manuscript.

ericbeyer / L-arxiv-interest-tracker

New submissions for Tue, 13 Sep 22 #629

Keyword: out of distribution detection

Keyword: out-of-distribution detection

Keyword: expected calibration error

Keyword: overconfident

Keyword: overconfidence

Keyword: confidence

Fairness in the Autobidding World with Machine-learned Advice

Detecting Driver Drowsiness as an Anomaly Using LSTM Autoencoders

Keyword: scaling

Examining stability of machine learning methods for predicting dementia at early phases of the disease

Graph Polynomial Convolution Models for Node Classification of Non-Homophilous Graphs

Continual learning benefits from multiple sleep mechanisms: NREM, REM, and Synaptic Downscaling

Fast-Response Variable Frequency DC-DC Converters Using Switching Cycle Event-Driven Digital Control

Keyword: calibration

Multi-modal Streaming 3D Object Detection

On the Factory Floor: ML Engineering for Industrial-Scale Ads Recommendation Models

Analysis and Comparison of Classification Metrics