'Artificial Intelligence' 카테고리의 글 목록

Notice

Recent Posts

Recent Comments

Link

« 2025/05 »
일	월	화	수	목	금	토
				1	2	3
4	5	6	7	8	9	10
11	12	13	14	15	16	17
18	19	20	21	22	23	24
25	26	27	28	29	30	31

Tags more

Archives

Today

Total

관리 메뉴

목록Artificial Intelligence (64)

juooo1117

Source-free Adaptive Gaze Estimation by Uncertainty Reduction

Paper ReviewSource-free Adaptive Gaze Estimation by Uncertainty Reduction(Accepted by CVPR 2023)https://ieeexplore.ieee.org/document/10204815AbstractWe present an unsupervised source-free domain adpatation approach for gaze estimation, which adapts a a source-trained gaze estimator to unlabeled target domains without source data. (도메인 차이를 해결하려는 접근으로써, 이 방식은 소스 데이터 없이도 타겟 도메인에 적응할 수 있도록 한다.)We pr..

Artificial Intelligence/Research Paper 2025. 3. 25. 20:16

RT-GENE: Real-Time Eye Gaze Estimation in Natural Environments

Paper ReviewRT-GENE: Real-Time Eye Gaze Estimation in Natural Environments(Proceedings of the European Conference on Computer Vision (ECCV), 2018, pp. 334-352)https://openaccess.thecvf.com/content_ECCV_2018/papers/Tobias_Fischer_RT-GENE_Real-Time_Eye_ECCV_2018_paper.pdfAbstractTwo main shortfalls in sota methos for gaze estimation:hindered ground truth gaze annotation and diminished gaze estimat..

Artificial Intelligence/Research Paper 2025. 3. 20. 23:40

MoST: Motion Style Transformer between Diverse Action Contents

Paper ReviewMoST: Motion Style Transformer between Diverse Action Contents(Accepted by CVPR 2024)https://doi.org/10.48550/arXiv.2403.06225AbstractLimitations of existing methods:Existing motion style transfer methods are effective between two motions with identical content, their performance significantly diminishes when transferring style between motions with different contents.(motion 에서는 cont..

Artificial Intelligence/Research Paper 2025. 3. 19. 06:48

Eyes Wide Shut? Exploring the Visual Shortcomings of Multimodal LLMs

Paper ReviewEyes Wide Shut? Exploring the Visual Shortcomings of Multimodal LLMshttps://arxiv.org/abs/2401.06209https://tsb0601.github.io/mmvp_blog/AbstractMultimodal Large Language Models(MLLMs) primarily rely on the reasoning capabilities of LLMs.**MLLMs 는 이미지를 대형 언어 모델 LLMs 에 통합해서, LLM의 능력을 활용하며, image understanding, visual question and answering, instruction following 같은 작업에서 뛰어난 성능을 보여준다. T..

Artificial Intelligence/Research Paper 2025. 3. 1. 04:54

Cross domain - Medical Image Segmentation

Cross-modality medical image segmentation서로 다른 종류의 의료 영상을 활용해서(e.g., MRI, CT, X-ray ...) 분석하는 방법으로, 다른 모달리티 간의 차이를 극복하고 공통된 특징을 추출하는 것이 중요하다. 한 가지 모달리티 만으로는 충분히 정확해지기 어려우므로, 다양한 의료 영상을 결합하면 더 정확한 진단이 가능하므로 이 방법을 사용할 수 있다. Research PaperMutual Information-based Disentangled Neural Networks for Classifying Unseen Categories in Different Domains: Application to Fetal Ultrasound Imaginghttps://www.n..

Artificial Intelligence/Research Paper 2025. 2. 3. 22:21

Cross-domain Image Analysis - Domain Shift

What is domain shift?Assumption of Neural Network :the training and test data are drawn from the same distribution. Domain shift :- corresponds to the feature distribution between training data and test data.- this problem is ubiquitous in many clinical application such as image classification and image segmentation.(e.g., performance degradation can be observed when applying a model that has be..

Artificial Intelligence/Research Paper 2025. 2. 3. 03:12

Vision-LSTM: xLSTM as Generic Vision Backbone

Vision-LSTMAn innovative architecture designed to handle CV tasks efficiently.An adaptation of a model originally designed for language processing, known as the LSTM. The extended version of this, called xLSTM, introduces several imporvemnets, making it scalable and efficient for various tasks, including CV. CV tasks have relied heavily on CNN and more recently, Transformers. However, each of th..

Artificial Intelligence/Research Paper 2024. 7. 6. 11:33

DeblurGAN: Blind Motion Deblurring Using Conditional Adversarial Networks

AbstractDeblurGAN : end-to-end learned method for motion deblurring. 학습은 conditional GAN, content loss 를 기반으로 진행된다. deblurring 모델의 품질은 실제 문제에서 새로운 방식으로 평가되며, 블러이미지에서 object detection을 통해 이루어진다. 이 방법은 최근의 경쟁자 Deep-Deblur 보다 5배 빠르다. 우리는 또한 선명한 이미지로부터 합성된 motion blurred images를 생성하는 새로운 방법을 소개하며, 이로 인해서 현실적인 데이터셋 증강이 가능해진다. 1. Introduction이 작업은 단일 사진의 blind motion deblurring에 관한 것이다. 최근 GAN을 적용한 이미..

Artificial Intelligence/Research Paper 2024. 5. 27. 14:56

Attention Is All You Need

Abstractdominant sequence transduction model 들은, 인코더와 디코더를 포함하는 recurrent or conv neural network에 기초한다. 가장 좋은 성능을 내는 모델들 또한 attention 메커니즘을 통해 인코더와 디코더로 연결되어있다. 우리는 새로운 단순한 네트워크 구조를 제안한다. ⇒ Transformer; based solely on attention mechanisms, dispensing with recurrence & conv entirely. 두 번역 실험에서 병렬화가 가능해지고, 상당히 적은 시간이 훈련에 필요했던 반면에 우수한 성능을 냈다. WMT 2014 영어-독일어 번역 태스크에서 28.4 BLEU(*Bilingual Evaluatio..

Artificial Intelligence/Research Paper 2024. 5. 20. 11:11

[Data Science Math Skills: Week4] Probability, Problem Solving Methods, Bayes' Theorem

Probability Probability : the degree of belief in the truth or falsity of a statement. Range of uncertainty from 0 to 1 - Certain statement is true : probability 1 - Certain statement is false : probability 0 Notation) 𝑃(𝑥) : probability of '𝑥' ~ 𝑥 : negation(부정) of statement '𝑥' Probability of a statement and the probability of the negation of a statement must sum to 1. If 𝑃(𝑥) = 1, then 𝑃(~𝑥) ..

Artificial Intelligence/Data Science Math Skills 2024. 4. 17. 15:34

이전 Prev 1 2 3 4 ··· 7 Next 다음

목록Artificial Intelligence (64)

juooo1117

티스토리툴바