아이공의 AI 공부 도전기

최신 글

[논문 Summary] DreaMoving (2023.12 Arxiv) "DreaMoving: A Human Video Generation Framework based on Diffusion Models"

[논문 Summary] DreaMoving (2023.12 Arxiv) "DreaMoving: A Human Video Generation Framework based on Diffusion Models" 목차 논문 정보 Citation : 2024.03.12 화요일 기준 1회 저자 Mengyang Feng, Jinlin Liu, Kai Yu, Yuan Yao, Zheng Hui, Xiefan Guo, Xianhui Lin, Haolan Xue, Chen Shi, Xiaowen Li, Aojie Li, Xiaoyang Kang, Biwen Lei, Miaomiao Cui, Peiran Ren, Xuansong Xie - Alibaba Group 논문 & Github 링크 Official Arxiv htt..
Generative Model
2024.03.12

[논문 Summary] Animate Anyone (2023.11 Arxiv) "Animate Anyone: Consistent and Controllable Image-to-Video Synthesis for Character Animation"

[논문 Summary] Animate Anyone (2023.11 Arxiv) "Animate Anyone: Consistent and Controllable Image-to-Video Synthesis for Character Animation" 목차 논문 정보 Citation : 2024.03.11 월요일 기준 10회 저자 Li Hu, Xin Gao, Peng Zhang, Ke Sun, Bang Zhang, Liefeng Bo - Institute for Intelligent Computing, Alibaba Group 논문 & Github 링크 Official Not Yet Arxiv https://arxiv.org/abs/2311.17117 Animate Anyone: Consistent and ..
Generative Model
2024.03.11

[논문 Summary] SDXL-Turbo (ADD) (2023.11 Arxiv) "Adversarial Diffusion Distillation"

[논문 Summary] SDXL-Turbo (ADD) (2023.11 Arxiv) "Adversarial Diffusion Distillation" 목차 논문 정보 Citation : 2024.02.03 토요일 기준 3회 저자 Axel Sauer, Dominik Lorenz, Andreas Blattmann, Robin Rombach - Stability AI 논문 & Github 링크 Official https://stability.ai/news/stability-ai-sdxl-turbo Introducing SDXL Turbo: A Real-Time Text-to-Image Generation Model — Stability AI SDXL Turbo is a new text-to-image mode ..
Generative Model
2024.02.03

[논문 Summary] Nerfies (2021 ICCV) "Nerfies: Deformable Neural Radiance Fields"

[논문 Summary] Nerfies (2021 ICCV) "Nerfies: Deformable Neural Radiance Fields" 목차 논문 정보 Citation : 2024.01.27 토요일 기준 871회 저자 Keunhong Park, Utkarsh Sinha, Jonathan T. Barron, Sofien Bouaziz, Dan B Goldman, Steven M. Seitz, Ricardo Martin-Brualla - University of Washington, Google Research 논문 링크 Official https://openaccess.thecvf.com/content/ICCV2021/papers/Park_Nerfies_Deformable_Neural_Radiance_..
3D
2024.01.27

[논문 Summary] NeRF-W (2021 CVPR) "NeRF in the Wild : Neural Radiance Fields for Unconstrained Photo Collections"

[논문 Summary] NeRF-W (2021 CVPR) "NeRF in the Wild : Neural Radiance Fields for Unconstrained Photo Collections" 목차 논문 정보 Citation : 2024.01.14 일요일 기준 922회 저자 Ricardo Martin-Brualla, Noha Radwan, Mehdi S. M. Sajjadi, Jonathan T. Barron, Alexey Dosovitskiy, Daniel Duckworth Google Research 논문 링크 Official https://openaccess.thecvf.com/content/CVPR2021/papers/Martin-Brualla_NeRF_in_the_Wild_Neural_R..
3D
2024.01.21

[논문 Summary] Score based generative model with SDE (2021 ICLR) "Score-Based Generative Modeling through Stochastic Differential Equations"

[논문 Summary] Score based generative model with SDE (2021 ICLR) "Score-Based Generative Modeling through Stochastic Differential Equations" 목차 논문 정보 Citation : 2023.12.03 토요일 기준 2072회 저자 Yang Song, Jascha Sohl-Dickstein, Diederik P. Kingma, Abhishek Kumar, Stefano Ermon, Ben Poole 논문 링크 Official https://openreview.net/forum?id=CzceR82CYc Score-Based Generative Modeling with Critically-Damped Lang..
카테고리 없음
2024.01.16

[논문 Summary] D-NeRF (2021 CVPR) "D-NeRF: Neural Radiance Fields for Dynamic Scenes"

[논문 Summary] D-NeRF (2021 CVPR) "D-NeRF: Neural Radiance Fields for Dynamic Scenes" 목차 논문 정보 Citation : 2024.01.07 일요일 기준 770회 저자 Albert Pumarola, Enric Corona, Gerard Pons-Moll, Francesc Moreno-Noguer 논문 링크 Official https://openaccess.thecvf.com/content/CVPR2021/papers/Pumarola_D-NeRF_Neural_Radiance_Fields_for_Dynamic_Scenes_CVPR_2021_paper.pdf Arxiv https://arxiv.org/abs/2011.13961 공식 Github ..
3D
2024.01.07

[논문 Summary] Tune-A-Video (2023 ICCV) "Tune-A-Video: One-Shot Tuning of Image Diffusion Models for Text-to-Video Generation"

[논문 Summary] Tune-A-Video (2023 ICCV) "Tune-A-Video: One-Shot Tuning of Image Diffusion Models for Text-to-Video Generation" 목차 논문 정보 Citation : 2024.01.14 일요일 기준 192회 저자 Jay Zhangjie Wu, Yixiao Ge, Xintao Wang, Weixian Lei, Yuchao Gu, Yufei Shi, Wynne Hsu, Ying Shan, Xiaohu Qie, Mike Zheng Shou 1) Show Lab, National University of Singapore 2) ARC Lab 3) Tencent PCG 4) School of Computing, Natio..
Generative Model
2023.11.10

[논문 Summary] AnimateDiff (2023.07 Arxiv) "AnimateDiff: Animate Your Personalized Text-to-Image Diffusion Modelswithout Specific Tuning"

[논문 Summary] AnimateDiff (2023.07 Arxiv) "AnimateDiff: Animate Your Personalized Text-to-Image Diffusion Models without Specific Tuning" 목차 논문 정보 Citation : 2023.10.28 토요일 기준 4회 저자 Yuwei Guo, Ceyuan Yang, Anyi Rao, Yaohui Wang, Yu Qiao, Dahua Lin, Bo Dai Shanghai AI Laboratory, The Chinese University of Hong Kong, Stanford University 논문 링크 Official https://arxiv.org/abs/2307.04725 Arxiv 공식 Githu..
Generative Model
2023.10.28

[Pytorch] 새로운 축 만들기 방법 (torch.unsqueeze, input[:, None])

torch tensor에서 새로운 축을 만들고자 할 때 사용할 수 있는 방법 1. torch.unsqueeze (input, dim) ex) torch.unsqueeze(input, dim=0) https://pytorch.org/docs/stable/generated/torch.unsqueeze.html torch.unsqueeze — PyTorch 2.1 documentation Shortcuts pytorch.org 2. input[:, None] numpy 에서도 비슷하게 사용가능 input[None, :] input[:, None] input[:, None, None] https://stackoverflow.com/questions/37867354/in-numpy-what-does-selecti..
Pytorch
2023.10.21

[논문 Summary] MAE(Masked AutoEncoder) (2022 CVPR) "Masked Autoencoders Are Scalable Vision Learners"

[논문 Summary] MAE(Masked AutoEncoder) (2022 CVPR) "Masked Autoencoders Are Scalable Vision Learners" 목차 논문 정보 Citation : 2023.10.02 월요일 기준 2950회 저자 Kaiming He, Xinlei Chen, Saining Xie, Yanghao Li, Piotr Dollár, Ross Girshick Meta (FAIR) 논문 링크 Official https://openaccess.thecvf.com/content/CVPR2022/papers/He_Masked_Autoencoders_Are_Scalable_Vision_Learners_CVPR_2022_paper.pdf Arxiv https://arxiv...
Self-Supervised Learning
2023.10.08

[해결] ImportError: cannot import name 'Literal' from 'typing'

ImportError: cannot import name 'Literal' from 'typing' 문제가 생기면 Python Version 확인 먼저 해야함. typing에서 Literal은 3.8 이상에서 사용가능한 package이므로 Python version 3.7 이하의 경우 실행 불가 에러가 뜸 고로, terminal에 'python --version'을 통해 확인 혹은 import sys sys.version 으로 Python 버전을 확인 먼저 진행 만약 Python version이 3.7이하라면 1) kernel 재설치 만약 terminal env로 재설치가 가능하다면 python = 3.8 이상 환경 구축 후 활용 2) typing_extensions package 활용 pip insta..
Python Bug Solution
2023.10.06

[논문 Summary] PerSAM (2023.05 arxiv) "Personalize Segment Anything Model with One Shot"

[논문 Summary] PerSAM (2023.05 arxiv) "Personalize Segment Anything Model with One Shot" 목차 논문 정보 Citation : 2023.06.30 금요일 기준 4회 저자 Renrui Zhang, Zhengkai Jiang, Ziyu Guo, Shilin Yan, Junting Pan, Hao Dong, Peng Gao, Hongsheng Li 논문 링크 Official Arxiv https://arxiv.org/abs/2305.03048 Personalize Segment Anything Model with One Shot Driven by large-data pre-training, Segment Anything Model (SAM) ha..
Generative Model
2023.08.31

[논문 Summary] Animated Drawings (2023 SIGGRAPH) "A Method for Animating Children's Drawings of the human Figure"

[논문 Summary] Animated Drawings (2023 SIGGRAPH) "A Method for Animating Children's Drawings of the human Figure" 목차 논문 정보 Citation : 저자 Harrison Jesse Smith, Qingyuan Zheng, Yifei Li, Somya Jain, Jessica K. Hodgins Meta, Tencent, MIT CSAIL, Carnegie Mellon University 논문 링크 Official https://dl.acm.org/doi/10.1145/3592788 Arxiv https://arxiv.org/abs/2303.12741 A Method for Animating Children's Draw..
Deep Learning
2023.08.30

[논문 Summary] DreamPose (2023.04 arxiv) "DreamPose: Fashion Image-to-Video Synthesis via Stable Diffusion"

[논문 Summary] DreamPose (2023.04 arxiv) "DreamPose: Fashion Image-to-Video Synthesis via Stable Diffusion" 목차 논문 정보 Citation : 2023.08.30 수요일 기준 4회 저자 Johanna Karras, Aleksander Holynski, Ting-Chun Wang, Ira Kemelmacher-Shlizerman 논문 링크 Official https://grail.cs.washington.edu/projects/dreampose/ DreamPose: Fashion Image-to-Video Synthesis via Stable Diffusion Given an image of a person and a seq..
Generative Model
2023.08.30

[논문 Summary] ControlNet (2023.02 arxiv) "Adding Conditional Control to Text-to-Image Diffusion Models"

[논문 Summary] ControlNet (2023.02 arxiv) "Adding Conditional Control to Text-to-Image Diffusion Models" 목차 논문 정보 Citation : 2023.05.14 토요일 기준 56회 저자 Lvmin Zhang, Maneesh Agrawala Stanford University 논문 링크 Official Arxiv https://arxiv.org/abs/2302.05543 논문 Summary Abstract 0. 정리 ControlNet은 Stable Diffusion과 같은 pretrained large diffusion model에서 추가적인 input condition에 대한 추가 학습을 시키는 것이 작은 데이터 세트 (< ..
Generative Model
2023.06.11

[논문 Summary] InstructPix2Pix (2023 CVPR) "InstructPix2Pix: Learning to Follow Image Editing Instructions"

[논문 Summary] InstructPix2Pix (2023 CVPR) "InstructPix2Pix: Learning to Follow Image Editing Instructions" 목차 논문 정보 Citation : 2023.05.01 월요일 기준 34회 저자 Tim Brooks, Aleksander Holynski, Alexei A. Efros - University of California, Berkeley 논문 링크 Official Arxiv https://arxiv.org/abs/2211.09800 InstructPix2Pix: Learning to Follow Image Editing Instructions We propose a method for editing images from ..
Generative Model
2023.05.14

[논문 Summary] DiffEdit (2022.10 arxiv) "DiffEdit: Diffusion-based semantic image editing with mask guidance"

[논문 Summary] DiffEdit (2022.10 arxiv) "DiffEdit: Diffusion-based semantic image editing with mask guidance" 목차 논문 정보 Citation : 2023.04.24 월요일 기준 32회 저자 Guillaume Couairon, Jakob Verbeek, Holger Schwenk (Meta AI), Matthieu Cord(Sorbonne Universit´e, Valeo.ai) 논문 링크 Official Not Yet Arxiv https://arxiv.org/abs/2210.11427 DiffEdit: Diffusion-based semantic image editing with mask guidance Image ge..
Generative Model
2023.04.24

[논문 Summary] CharNet (2019 ICCV) "Convolutional Character Networks"

[논문 Summary] CharNet (2019 ICCV) "Convolutional Character Networks" 목차 논문 정보 Citation : 2023.04.23 일요일 기준 125회 저자 Linjie Xing, Zhi Tian, Weilin Huang, Matthew R. Scott 논문 링크 Official https://openaccess.thecvf.com/content_ICCV_2019/html/Xing_Convolutional_Character_Networks_ICCV_2019_paper.html Arxiv https://arxiv.org/abs/1910.07954 논문 Summary Abstract 0. 설명 시작 전 Overview 1. Introduction 이미지에서 te..
Object Detection
2023.04.23

[논문 Summary] Deformable DETR (2021 ICLR Oral) "Deformable DETR: Deformable Transformers for End-to-End Object Detection"

[논문 Summary] Deformable DETR (2021 ICLR Oral) "Deformable DETR: Deformable Transformers for End-to-End Object Detection" 목차 논문 정보 Citation : 2022.12.15 토요일 기준 1369회 저자 Xizhou Zhu, Weijie Su, Lewei Lu, Bin Li, Xiaogang Wang, Jifeng Dai - SenseTime Research, University of Science and Technology of China, Chinese University of Hong Kong 논문 링크 Official https://openreview.net/forum?id=gZ9hCDWe6ke Def..
Object Detection
2023.04.19

[논문 Summary] FamNet (2021 CVPR) "Learning To Count Everything"

[논문 Summary] FamNet (2021 CVPR) "Learning To Count Everything" 목차 논문 정보 Citation : 2023.02.11 토요일 기준 24회 저자 Viresh Ranjan, Udbhav Sharma, Thu Nguyen, Minh Hoai 논문 링크 Official https://openaccess.thecvf.com/content/CVPR2021/papers/Ranjan_Learning_To_Count_Everything_CVPR_2021_paper.pdf Arxiv https://arxiv.org/abs/2104.08391 Learning To Count Everything Existing works on visual counting primarily f..
Deep Learning
2023.03.04

[논문 Summary] CFOCNet (2021 WACV) "Class-agnostic Few-shot Object Counting"

[논문 Summary] CFOCNet (2021 WACV) "Class-agnostic Few-shot Object Counting" 목차 논문 정보 Citation : 2023.02.03 금요일 기준 21회 저자 Shuo-Diao Yang, Hung-Ting Su, Winston H. Hsu, Wen-Chin Chen 논문 링크 Official https://openaccess.thecvf.com/content/WACV2021/papers/Yang_Class-Agnostic_Few-Shot_Object_Counting_WACV_2021_paper.pdf Arxiv 논문 Summary Abstract 0. 설명 시작 전 Overview Few shot learning을 활용한 object counting..
Deep Learning
2023.02.28

[논문 Summary] SAFECount(2023 WACV) "Few-shot Object Counting with Similarity-Aware Feature Enhancement"

[논문 Summary] SAFECount(2023 WACV) "Few-shot Object Counting with Similarity-Aware Feature Enhancement" 목차 논문 정보 Citation : 2023.02.03 금요일 기준 2회 저자 Zhiyuan You, Kai Yang, Wenhan Luo, Xin Lu, Lei Cui, Xinyi Le 논문 링크 Official https://openaccess.thecvf.com/content/WACV2023/papers/You_Few-Shot_Object_Counting_With_Similarity-Aware_Feature_Enhancement_WACV_2023_paper.pdf Arxiv https://arxiv.org/abs/22..
Deep Learning
2023.02.28

[논문 Summary] CSPNet (2020 CVPR) "CSPNet: A New Backbone that can Enhance Learning Capability of CNN"

[논문 Summary] CSPNet (2020 CVPR) "CSPNet: A New Backbone that can Enhance Learning Capability of CNN" 목차 논문 정보 Citation : 2023.01.15 일요일 기준 1693회 저자 Chien-Yao Wang, Hong-Yuan Mark Liao, I-Hau Yeh, Yueh-Hua Wu, Ping-Yang Chen, Jun-Wei Hsieh - Taiwan, Institute of Information Science Academia Sinica, National Chiao Tung University 논문 링크 Official https://openaccess.thecvf.com/content_CVPRW_2020/pape..
Deep Learning
2023.01.15

[논문 Summary] CutMix (2019 ICCV) "CutMix : Regularization Strategy to Train Strong Classifiers with Localizable Features"

[논문 Summary] CutMix (2019 ICCV) "CutMix : Regularization Strategy to Train Strong Classifiers with Localizable Features" 목차 논문 정보 Citation : 2023.01.14 토요일 기준 2472회 저자 Sangdoo Yun, Dongyoon Han, Seong Joon Oh, Sanghyuk Chun, Junsuk Choe, Youngjoon Yoo Naver Clova AI Research 논문 링크 Official https://openaccess.thecvf.com/content_ICCV_2019/papers/Yun_CutMix_Regularization_Strategy_to_Train_Strong_C..
논문 Summary
2023.01.14
loading