'분류 전체보기' 카테고리의 글 목록 (5 Page)

[3D Pose and Shape] 3D Human Pose Visualizer

https://github.com/isarandi/poseviz GitHub - isarandi/poseviz: 3D Human Pose Visualizer for Python 3D Human Pose Visualizer for Python. Contribute to isarandi/poseviz development by creating an account on GitHub. github.com

AI Research Topic/3D Pose and Shape

npm ERR! To see a list of scripts, run

npm run dev 명령어를 쳤는데 아래와 같은 에러가 뜬다면 npm ERR! To see a list of scripts, run npm start npm start 명령어를 통해 package.json 파일에 시작 스크립트 추가 하면 된다. 참고자료 : https://sebhastian.com/npm-err-missing-script-start/ How to fix npm ERR! missing script: start error Learn how to fix the npm ERR missing script start error. sebhastian.com

삽질 기록

Parcel encountered errors

Parcel encountered errors Error: Expected content key (error key) to exist 자바스크립트에서 갑자기(는 아니고 다른 패키지를 import 못했을 때)위와 같은 에러가 날 때 루트 디렉토리에서 parcel-cache를 삭제해주면 된다. rm -r .parcel-cache 참고자료 : https://github.com/parcel-bundler/parcel/issues/8874 parcel watch error: Expected content key 2d39cdf7c618ab5b to exist · Issue #8874 · parcel-bundler/parcel 🐛 bug report When running parcel watch 'packages/*..

삽질 기록

선형대수 정리 잘 되어있는 사이트

https://angeloyeo.github.io/2020/09/07/basic_vector_operation.html 벡터의 기본 연산(상수배, 덧셈) - 공돌이의 수학정리노트 (Angelo's Math Notes) angeloyeo.github.io

기타/참고자료

[Transformer] Patch n’ Pack: NaViT, a Vision Transformer for any Aspect Ratio and Resolution

Overview 얼마전 Google DeepMind에서 "Patch n’ Pack: NaViT, a Vision Transformer for any Aspect Ratio and Resolution" 라는 논문이 나왔습니다. 기존 컴퓨터 비전 모델에서는 이미지를 처리하기 전에 고정된 해상도로 이미지 크기를 설정하게 되고, ViT(Vision Transformer)와 같은 모델은 flexible sequence-based modeling을 하기 때문에 다양한 input sequence length를 제공하게 됩니다. 본 논문에서는 임의의 해상도와 종횡비(aspect ratio)를 처리하기 위해 학습 중에 sequence packing을 사용하는 NaViT(Native Resolution ViT)를 제안합니..

AI Research Topic/Transformer

[티스토리] 관리자 메뉴 진입 단축키

티스토리 스킨을 바꾸다보면 관리자 메뉴 버튼이 없는 스킨이 가끔 있다. 그럴 땐 본인 티스토리 메인 페이지에서 Ctrl + Q 를 입력하면 바로 관리자 메뉴로 진입이 가능하다.

기타/티스토리

[Book Review] ChatGPT 책 추천 - 인공지능 전문가가 알려 주는 챗GPT로 대화하는 기술

인공지능을 개발하는 입장에서 요즘 느끼는 것은 인공지능이 예전보다 훨씬 대중화 되었고, 인공지능을 개발하거나 서비스를 만드는데 있어서 진입장벽이 훨씬 낮아졌다는 생각이 많이 든다. 그 이유는 바로 ChatGPT, Diffusion 기술 때문이다. ChatGPT 또는 Diffusion 기술들을 이용하여 서비스를 만들거나 창작물을 만들어 낼 때 전문가만큼의 지식이나 기술, 배경, 원리를 알지 못하더라도 서비스를 사용하거나 직접 만들어낼 수 있는 세상이 왔다. 어쩌면 앞으로는 이러한 기술의 원리를 알지 못하더라도 서비스를 사용하는데 상관없어지지 않을까 싶다. 개인적인 생각으로는 물론 엔지니어 입장에서 기술의 배경을 아는 것과 모르는 것의 차이는 크다고 생각하고, 아는 것이 필요하다고 생각하지만, 결국 이런 기..

Book Review

[Backbone] DreamTeacher: Pretraining Image Backbones with Deep Generative Models

Overview 오늘 리뷰할 논문은 NVIDIA, University of Toronto, Vector Institute, MIT에서 낸 "DreamTeacher: Pretraining Image Backbones with Deep Generative Models" 입니다. ICCV 2023에 채택되었네요. 이 논문에서는 downstream image backbone을 사전 학습 하기 위해 generative network를 활용하는 DreamTeacher 기법을 소개합니다. 즉, 생성 모델의 knowledge를 특정 인식 작업을 위해 잘 설계된 standard image backbone으로 추출하는거죠. 어떻게 보면 역으로 사용했다고 보면 되네요. 먼저 label-efficient semantic se..

AI Research Topic/Backbone

[Diffusion] DISCO: Disentangled Control for Referring Human Dance Generation in Real World

Project page : https://disco-dance.github.io/ DISCO: Disentangled Control for Referring Human Dance Generation in Real World Generative AI has made significant strides in computer vision, particularly in image/video synthesis conditioned on text descriptions. Despite the advancements, it remains challenging especially in the generation of human-centric content such as dance synt disco-dance.gith..

AI Research Topic/Diffusion

[Model Optimization] Speed Is All You Need: On-Device Acceleration of Large Diffusion Models via GPU-Aware Optimizations

Paper : https://arxiv.org/abs/2304.11267 Blog : https://ai.googleblog.com/2023/06/speed-is-all-you-need-on-device.html Speed is all you need: On-device acceleration of large diffusion models via GPU-aware optimizations Posted by Juhyun Lee and Raman Sarokin, Software Engineers, Core Systems & Experiences The proliferation of large diffusion models for image generation has led to a significant in..

AI Research Topic/Model Optimization

[Diffusion] 생성모델에서의 Latent Space, Latent Space Vector 개념

생성 모델은 해당 데이터의 분포를 따르는 기존에는 없던 새로운 데이터를 생성해내는 모델이다. 생성모델은 데이터의 분포를 학습하게 되는데, 데이터는 Latent Space를 가지고 있으며 학습 할 때 이 Latent Space를 학습하는 것이라고 할 수 있다. 참고로 latent는 데이터가 가지고 있는 잠재적인 변수를 뜻한다. Latent Vector는 한 이미지가 가지고 있는 잠재적인 벡터 형태의 변수이고, Latent Vector들이 모여서 latent space가 형성이 되는 것이다. Latent Space에는 우리가 학습시킬 이미지들이 Latent Vector의 분포 형태로 존재하게 되고 모델을 통해서 이미지가 가지고 있는 Latent의 분포를 학습하게 된다. 기존의 이미지 분류, 탐지 모델들은 이..

AI Research Topic/Diffusion

[Diffusion] ControlNet, Adding Conditional Control to Text-to-Image Diffusion Models

오늘 리뷰할 논문은 ControlNet 입니다. Stable Diffusion 구조를 이용하여 specific-task condition을 학습 할 수 있도록 한 end-to-end 모델입니다. ControlNet의 핵심은 Zero Convolution이라고 할 수 있겠네요. 이번 리뷰 논문은 pdf로 작업했기 때문에 이미지로 대체하여 업로드 합니다. 🫶

AI Research Topic/Diffusion

Prev 1 2 3 4 5 6 7 8 ··· 59 Next

티스토리툴바