Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
8.5
TFLOPS
12
13
135
Clelia Astra Bertelli
as-cle-bert
Follow
adamm-hf's profile picture
richardsontm's profile picture
jerryrt's profile picture
2,717 followers
ยท
40 following
https://www.clelia.dev
itsclelia
AstraBert
clelia-astra-bertelli-583904297
cle-does-things.bsky.social
AI & ML interests
Biology + Artificial Intelligence = โค๏ธ | AI for sustainable development, sustainable development for AI | Researching on Machine Learning Enhancement | I love automation for everyday things | Blogger | Open Source
Recent Activity
liked
a dataset
2 days ago
llamaindex/liteparse_bench_small
liked
a model
7 months ago
facebook/dinov2-small
posted
an
update
about 1 year ago
Let's pipe some ๐ฑ๐ฎ๐๐ฎ ๐ณ๐ฟ๐ผ๐บ ๐๐ต๐ฒ ๐๐ฒ๐ฏ into our vector database, shall we?๐ค With ๐ข๐ง๐ ๐๐ฌ๐ญ-๐๐ง๐ฒ๐ญ๐ก๐ข๐ง๐ ๐ฏ๐.๐.๐ (https://github.com/AstraBert/ingest-anything) you can now scrape content simply starting from URLs, extract the text from it, chunk it and put it into your favorite LlamaIndex-compatible database!๐ธ๏ธ You can do it thanks to ๐ฐ๐ฟ๐ฎ๐๐น๐ฒ๐ฒ by Apify, an open-source crawling library for python and javascript that handles all the data flow from the web: ingest-anything then combines it with ๐๐ฒ๐ฎ๐๐๐ถ๐ณ๐๐น๐ฆ๐ผ๐๐ฝ, ๐ฃ๐ฑ๐ณ๐๐๐๐ผ๐๐ป and ๐ฃ๐๐ ๐๐ฃ๐ฑ๐ณ to scrape HTML files, convert them to PDF and extract the text - hassle-free!๐ธ Check the attached code snippet if you're curious of knowing how to get started๐ฌ PS: Don't tell anybody, but this release also has another gem... It supports OpenAI models for agentic chunking, following the new releases of Chonkie๐ฆโจ If you don't want to miss out on the new features, leave us a little star on GitHub โก๏ธ https://github.com/AstraBert/ingest-anything And join our discord community! โก๏ธ https://discord.gg/kDqHNjks
View all activity
Organizations
as-cle-bert
's datasets
15
Sort:ย Recently updated
as-cle-bert/DebateLLMs
Viewer
โข
Updated
Dec 30, 2024
โข
20
โข
25
โข
4
as-cle-bert/architecture_vs_normal_image_prompts
Viewer
โข
Updated
Nov 8, 2024
โข
6k
โข
11
โข
2
as-cle-bert/speckledata
Viewer
โข
Updated
Jun 3, 2024
โข
2.43k
โข
13
as-cle-bert/saccaromyces-cerevisiae-base
Viewer
โข
Updated
Apr 16, 2024
โข
368
โข
20
โข
1
as-cle-bert/AMR-Gene-Families
Viewer
โข
Updated
Apr 1, 2024
โข
1.5k
โข
166
โข
1
as-cle-bert/scerevisiae-proteins-reduced
Viewer
โข
Updated
Apr 1, 2024
โข
600
โข
13
as-cle-bert/plastic-enzymes
Viewer
โข
Updated
Apr 1, 2024
โข
1.64k
โข
55
โข
1
as-cle-bert/scerevisiae-transcripts-biotypes
Viewer
โข
Updated
Mar 31, 2024
โข
6.72k
โข
60
โข
1
as-cle-bert/breastcancer-semantic-segmentation
Viewer
โข
Updated
Mar 31, 2024
โข
40
โข
52
as-cle-bert/banana-disease-classification
Viewer
โข
Updated
Mar 31, 2024
โข
777
โข
154
โข
2
as-cle-bert/breastcancer-auto-objdetect
Viewer
โข
Updated
Mar 30, 2024
โข
547
โข
73
โข
1
as-cle-bert/breastcancer-auto-segmentation
Viewer
โข
Updated
Mar 30, 2024
โข
547
โข
84
โข
1
as-cle-bert/breastcanc-ultrasound-class
Viewer
โข
Updated
Mar 29, 2024
โข
647
โข
100
โข
2
as-cle-bert/VirBiCla-training
Viewer
โข
Updated
Mar 20, 2024
โข
60k
โข
5
โข
1
as-cle-bert/genetics-arxiv-wiki
Viewer
โข
Updated
Mar 7, 2024
โข
23.3k
โข
26
โข
2