Existing automatic data augmentation (DA) methods either ignore updating DA's
parameters according to the target model's state during training or adopt
update strategies that are not effective enough.
Neural network models have demonstrated impressive performance in predicting
pathologies and outcomes from the 12-lead electrocardiogram (ECG). However,
these models often need to be trained with larg
In this paper, we introduce Random Erasing, a new data augmentation method
for training the convolutional neural network (CNN). In training, Random
Erasing randomly selects a rectangle region in an im
The automatic detection of hypernymy relationships represents a challenging
problem in NLP. The successful application of state-of-the-art supervised
approaches using distributed representations has g
Interdisciplinary research is often at the core of scientific progress. This
dissertation explores some advantageous synergies between machine learning,
cognitive science and neuroscience. In particul
Data augmentation policies drastically improve the performance of image recognition tasks, especially when the policies are optimized for the target data and tasks. In this paper, we propose to optimi
Data augmentation is a key practice in machine learning for improving generalization performance. However, finding the best data augmentation hyperparameters requires domain knowledge or a computation
In many applications of machine learning, certain categories of examples may
be underrepresented in the training data, causing systems to underperform on
such "few-shot" cases at test time. A common r
Data augmentation is a commonly applied technique with two seemingly related
advantages. With this method one can increase the size of the training set
generating new samples and also increase the inv
Data augmentation has led to substantial improvements in the performance and
generalization of deep models, and remain a highly adaptable method to evolving
model architectures and varying amounts of
In this paper, we propose a novel graph-based data augmentation method that
can generally be applied to medical waveform data with graph structures. In the
process of recording medical waveform data,
Data augmentation (DA) techniques aim to increase data variability, and thus train deep networks with better generalisation. The pioneering AutoAugment automated the search for optimal DA policies wit
Recent advances in commonsense reasoning depend on large-scale
human-annotated training data to achieve peak performance. However, manual
curation of training examples is expensive and has been shown
We introduce style augmentation, a new form of data augmentation based on
random style transfer, for improving the robustness of convolutional neural
networks (CNN) over both classification and regres
We study the effect of seven data augmentation (da) methods in factoid question answering, focusing on the biomedical domain, where obtaining traininginstances is particularly difficult.
We experiment with data from the bioasqchallenge, which we augment with training instances obtained from an artificial machine reading comprehension dataset, or via back-translation, information retrieval, word substitution based on word2vec embeddings, or masked language modeling, question generation, or extending the given passagewith additional context.
Semi-supervised learning lately has shown much promise in improving deep
learning models when labeled data is scarce. Common among recent approaches is
the use of consistency training on a large amoun