-
so i have studyed your code.
it is very interesting as replication of INSWAPPER, but right now it is only distillation of inswapper. this will limit this model.
we can use this replication as initia…
-
https://github.com/pulsejet/memories/tree/master/src/native.ts#L16
https://github.com/pulsejet/memories/tree/master/src/components/viewer/Viewer.vue#L808
I have a feature idea for the app:
IMAGE_HQ…
-
# System specs
- Windows 10
- AMD 6600
- 32 Gb Ram
# Settings
- Output resolution 1280 x 960
- RealESR_Gx4
- GPU Auto
- AI Multithreading 1
- GPU VRAM 8Gb
- Video Output x264
- AI Interp…
-
-
### 🚀 The feature
Implement a GitHub Actions workflow to automate the build and release process for the Flutter application. This workflow will run automated tests on each code push to ensure code qu…
-
# Goal
Replace existing TTS cascade with a speech decoder that directly generates speech. This change will replace the current TTS cascade which adds latency to ichigo's response time.
# Potential So…
-
While tinkering with the encoding APIs in zune-image, I found an oddity in the type `EncoderOptions`. Documentation for the [getter](https://docs.rs/zune-core/0.4.0/zune_core/options/struct.EncoderOpt…
Enet4 updated
1 month ago
-
# Wavelet Latent Diffusion (Wala): Billion-Parameter 3D Generative Model with Compact Wavelet Encodings · AI Paper Reviews by AI
WaLa: a billion-parameter 3D generative model using wavelet encodings …
-
Hi Vijay Thakkar!
Looks like you've continued to work a lot on `tensorflow-wavenet`. Do you mind posting some samples?
Thanks!
-
EchoMimicV2 utilizes a reference image, an audio clip, and a sequence of hand pose to generate a high-quality animation video,
ensuring coherence between audio content and half-body movements.
Proje…