Back in mid-2019, while I was working on the Data Platform team at Krom, I ran into a problem that lingered long after it was fixed. Our Celery workers would occasionally stop processing tasks without any obvious signal. Supervisor still showed them as RUNNING, with uptimes measured in days, but zero tasks were processed. Data pipelines quietly stalled, alerts never fired, and the only clue was an occasional Broken pipe buried in the logs. ...

Honored to Be an Elastic Contributor
I’m thrilled to share that I have been recognized as an Elastic Bronze Contributor for 2023. This achievement is a reflection of my commitment to the open-source community and a testament to the valuable work being done in the Elasticsearch ecosystem. My efforts have primarily focused on solving real-world challenges faced by developers in the Vietnamese market and beyond. My key areas of contribution include: Vietnamese Analysis Plugin: Ensuring robust and accurate search capabilities for complex Vietnamese language data. ...

Understanding Elasticsearch searching data flow
Search is a generic term for information retrieval. Elasticsearch provides various retrieval capabilities, including full-text searches, geo searches, range searches, scripted searches, and aggregations. How does Elasticsearch execute a search query behind the scenes? The following diagram shows data flow for searching operations. ...

Understanding Elasticsearch deleting data flow
How is a document deleted from Elasticsearch? The following diagram shows data flow behinds the scene of deleting a single document. ...

Understanding Elasticsearch indexing data flow
When a document is stored in Elasticsearch, it is indexed and fully searchable within 1 second. Elasticsearch uses an inverted index data structure that supports full-text searches efficiently and very fast. ...