Llamaindex

Thai language RAG with Llamaindex + Weaviate + SeaLLM

Matichon Maneegard

05 Jan 2024 — 2 min read

Sa-Wad-Dee, Hello from Thailand.

Introduce

RAG is like this cool AI tool designed for developers and teams who want to bring LLMs into new features. But gotta admit, there's a small hiccup - we don't really have a ton of tutorial projects and this leads to a couple of big issues.

Everything's pretty much in English. So for those who aren't native English speakers, they have to tweak their solutions and spend that extra time getting acquainted with stuff like Tokenizers or Text splitters.
Everything's pretty much geared for OpenAI's API, which is the easiest and most efficient to use. But on the flip side, this makes other AI providers and embedding models feel left out. And let's not forget those developers who, trying to save some bucks, want to use other AI and embeddings - they're hit with crazy high switching costs.

These issues end up slowing the AI adoption process and bumping up costs, especially for firms not primarily using English.

So, the idea to create use cases with examples came about. It's pretty much a way to help developers learn faster and get AI applications up and running quicker, whether to boost existing features or whip up new ones.

Challenge

Tiktoken counts each character in the Thai language as one token, whereas a local AI model could consider each word as one token.
Thai language doesn't complete sentences with a "full stop".
Integrate Huggingface Embedding and an OpenAI API-like with LlamaIndex.
Highly customize the vector search using Weaviate.

Environment

LlamaIndex (Data Framework)
Weaviate (Vector Database)
SeaLLM-7b (AI Model) Serving by Float16.cloud
intfloat/multilingual-e5-large (Embedding Model)

The notebook

Warning. I have commented the code in Thai language.

Read more

AI Bootcamp: LLM Finetuning & Deployment

AI Bootcamp: LLM Finetuning & Deployment

On Friday, July 4th, 2025, Float16 in collaboration with the Typhoon SCB 10X team organized the AI Bootcamp: LLM Finetuning & Deployment at DistrictX, FYI Building. This event marked a significant milestone in promoting AI technology development in Thailand. The bootcamp received overwhelming interest and was successfully completed beyond expectations.

AI Bootcamp: LLM Finetuning & Deployment

AI Bootcamp: LLM Finetuning & Deployment

เมื่อวันศุกร์ที่ 4 กรกฎาคม 2025 ที่ผ่านมา Float16 ร่วมกับทีม Typhoon SCB 10X จัดงาน AI Bootcamp: LLM Finetuning & Deployment ขึ้นที่ DistrictX ตึก FYI ซึ่งถือเป็นก้าวสำคัญในการส่งเสริมการพัฒนาเทคโนโลยี AI ในประเทศไทย งานนี้ได้รับความสนใจอย่างล้นหลาม

LLM Arena: No More Guessing Games When Choosing AI Models

LLM Arena: No More Guessing Games When Choosing AI Models

หลายคนคงเจอปัญหาเดียวกับเรา ตอนที่ต้องเลือก LLM model มาใช้งาน ไม่รู้ว่าควรเลือก model ไหนดี อ่านสเปคก็ดูเหมือนจะดีทุกตัว แต่พอไปใช้งานจริงไม่ตอบโจทย์งานนั้น ๆ เลยคิดว่าทำไมเราไม่สร้างตัวช่วยขึ้นมาล่ะ เอาโมเดลหลายๆ ตัวมาเปรียบเทียบกันแบบเห็

GPU monitoring dashboard

GPU monitoring dashboard

บทความนี้ผมจะพาทุกคนมาเรียนรู้การทำ monitoring dashboard ของ GPU ด้วย grafana กันนะครับ โดยจะเริ่มกันตั้งแต่วิธีการติดตั้ง grafana จนไปถึงการตั้งค่าให้รับค่าการทำงานจาก gpu โดยใช้ dcgm-exporter ผ่าน prometheous จนสามารถสร้างเป็น dashboard ที่ดูการทำงานต่างๆของ GPU ได้ และทั้งหมดเราจะทำการ