saas-things (SaaS Things)

posted an update 7 days ago

Post

131

Updated the demo for the new version of the W2V-BERT model for Ukrainian audio recognition.

This is a classic Automatic Speech Recognition or Speech to Text task.

What's new in version three:

• more data: 1200 hours
• new SentencePiece tokenizer with 512 tokens
• feature extraction is done via a Rust extension

Facts:

• Training was started from the previous model to speed up the learning process.
• Training takes place on two 3090 video cards with 24 GB each.
• It is well suited for fine-tuning because the training data is very diverse and mostly noisy.

You can try it here:

Yehor/w2v-bert-uk-v3

Download weights here:

speech-uk/w2v-bert-v3

If you wish to support the speech-uk initiative with a donation, here is the link to Monobank:

https://send.monobank.ua/jar/3Saxixsdua

Yehor

posted an update 3 months ago

Post

441

A useful tool for all who works with audio datasets: https://github.com/RustedBytes/data-viewer-audio

Yehor

posted an update 6 months ago

Post

474

Added an Apptainer image to Kulyk:

Yehor/kulyk-sif

Yehor

posted an update 6 months ago

Post

329

If you work with Audio ML, look at https://github.com/RustedBytes/wav-files-toolkit

1 reply

·

Yehor

posted an update 6 months ago

Post

286

Containerized Yehor/kulyk-en-uk and Yehor/kulyk-uk-en so you can just pull an image and run CPU-version to do machine translation:

docker run -p 3000:3000 --rm ghcr.io/egorsmkv/kulyk-rust:latest

Yehor

updated 3 Spaces 7 months ago

Metrics Explained

📈

SaaS metrics with detailed explanations

Metrics Analyzer with Visualizations

🌍

Analyze SaaS metrics and generate visual reports

Valuation Calculator

📊

Calculate how much your SaaS cost

Yehor

published 2 Spaces 7 months ago

Valuation Calculator

📊

Calculate how much your SaaS cost

Metrics Explained

📈

SaaS metrics with detailed explanations

Yehor

updated 2 Spaces 7 months ago

Fake Data Generator

📚

Generate fake SaaS financial data for a given date range

Metrics Calculator

⚡

Сalculate key SaaS metrics for business valuation and health

Yehor

published 3 Spaces 7 months ago

Metrics Analyzer with Visualizations

🌍

Analyze SaaS metrics and generate visual reports

Fake Data Generator

📚

Generate fake SaaS financial data for a given date range

Metrics Calculator

⚡

Сalculate key SaaS metrics for business valuation and health

Yehor

posted an update 9 months ago

Post

770

A new lightweight model to do machine translation from English to Ukrainian using recently published LFM2 model. Use demo Yehor/en-uk-translator to test it.

Facts:
- Fine-tuned with 40M samples (filtered by quality metric) from ~53.5M for 1.4 epochs
- 354M params
- Requires 1 GB of RAM to run with bf16
- BLEU on FLORES-200: 27.24
- Tokens per second: 229.93 (bs=1), 1664.40 (bs=10), 8392.48 (bs=64)
- License: lfm1.0

Mode page: Yehor/kulyk-en-uk

5 replies

·

Yehor

posted an update about 1 year ago

Post

967

Esoteric practices: inference models in PHP!

Repository: https://github.com/egorsmkv/speech-to-text-using-php

Yehor

posted an update about 1 year ago

Post

2513

Made a workable program that uses IREE runtime using Rust to inference wav2vec2-bert model for Automatic Speech Recognition.

1 reply

·

Yehor

posted an update about 1 year ago

Post

2708

I have made a Rust project with integration of the latest state-of-the-art model for object detection, it outperforms YOLO!

Check it out: https://github.com/egorsmkv/rf-detr-usls

2 replies

·

Yehor

posted an update about 1 year ago

Post

2138

Convert your audio data to Parquet/DuckDB files with blazingly fast speeds!

Repository with pre-built binaries: https://github.com/crs-org/audios-to-dataset

2 replies

·

AI & ML interests

Team members 1

saas-things's activity

Metrics Explained

Metrics Analyzer with Visualizations

Valuation Calculator

Valuation Calculator

Metrics Explained

Fake Data Generator

Metrics Calculator

Metrics Analyzer with Visualizations

Fake Data Generator

Metrics Calculator