Llamaindex

Thai language RAG with Llamaindex + Weaviate + SeaLLM

Matichon Maneegard

05 Jan 2024 — 2 min read

Sa-Wad-Dee, Hello from Thailand.

Introduce

RAG is like this cool AI tool designed for developers and teams who want to bring LLMs into new features. But gotta admit, there's a small hiccup - we don't really have a ton of tutorial projects and this leads to a couple of big issues.

Everything's pretty much in English. So for those who aren't native English speakers, they have to tweak their solutions and spend that extra time getting acquainted with stuff like Tokenizers or Text splitters.
Everything's pretty much geared for OpenAI's API, which is the easiest and most efficient to use. But on the flip side, this makes other AI providers and embedding models feel left out. And let's not forget those developers who, trying to save some bucks, want to use other AI and embeddings - they're hit with crazy high switching costs.

These issues end up slowing the AI adoption process and bumping up costs, especially for firms not primarily using English.

So, the idea to create use cases with examples came about. It's pretty much a way to help developers learn faster and get AI applications up and running quicker, whether to boost existing features or whip up new ones.

Challenge

Tiktoken counts each character in the Thai language as one token, whereas a local AI model could consider each word as one token.
Thai language doesn't complete sentences with a "full stop".
Integrate Huggingface Embedding and an OpenAI API-like with LlamaIndex.
Highly customize the vector search using Weaviate.

Environment

LlamaIndex (Data Framework)
Weaviate (Vector Database)
SeaLLM-7b (AI Model) Serving by Float16.cloud
intfloat/multilingual-e5-large (Embedding Model)

The notebook

Warning. I have commented the code in Thai language.

Float16 @ Techsauce Global Summit 2025

Techsauce Global Summit 2025 has concluded on August 4-6, 2025, bringing together leading tech companies from Thailand and around the world to showcase their latest innovations and breakthroughs. Float16 participated in this event for the second consecutive year, and over these three days, we had numerous engaging conversations with interested

Float16 @ Techsauce Global Summit 2025

ผ่านไปแล้วกับงาน Techsauce Global Summit 2025 ในวันที่ 4-6 สิงหาคม 2025 ซึ่งเป็นงานที่รวบรวมบริษัท Tech ชั้นนำในไทยและต่างประเทศ มาออก Showcase นำเสนอผลงานและนวัตกรรมใหม่ๆ โดย Float16 ก็ได้เข้าร่วมงานนี้เป็นปีที่ 2 ซึ่ง 3 วันที่ผ่านมาก็มีทั้งคนเข้

Typhoon-OCR-7b พร้อมใช้แล้ว !!

Typhoon-OCR-7b สามารถใช้ผ่าน AI as a Service ของ Float16 ได้แล้ววันนี้ รายละเอียด Typhoon-OCR-7b Typhoon-OCR-7b เป็น Model จากทีม Typhoon (SCB10X) โดยเป็นการต่อยอดจาก Model Qwen-2.5-vl-7b Typhoon-OCR-7b มีประสิทธิภาพ OCR ได้ดีกว่า GPT-4o และ Gemini 2.5 ซึ่งสามารถนำไปใช้ได้อย่

HiDream I1 - The best Open source for Image Gen

HiDream I1 is an open-source image generator. HiDream I1 comes with 3 variants: Full, Dev and Fast. * HiDream I1 Full is the full version of HiDream. This version uses more compute power and time to achieve higher image quality. * HiDream I1 Dev is a distilled version of the HiDream Full

Sa-Wad-Dee, Hello from Thailand.

Introduce

Challenge

Environment

The notebook

Read more

Float16 @ Techsauce Global Summit 2025

Float16 @ Techsauce Global Summit 2025

Typhoon-OCR-7b พร้อมใช้แล้ว !!

HiDream I1 - The best Open source for Image Gen