What is AIWiki Malaysia?

AIWiki Malaysia is a free, open AI knowledge base covering artificial intelligence concepts, tools, models, and use cases — written specifically for Malaysian professionals and students. It is maintained by AITG Sdn Bhd, an AI company based in Penang.

Who maintains AIWiki Malaysia?

AIWiki Malaysia is maintained by AITG Sdn Bhd (Registration: 202601016521 (1678618-W)), an AI company headquartered in George Town, Penang, Malaysia. The editorial team continuously updates and expands the knowledge base.

What topics does AIWiki Malaysia cover?

AIWiki Malaysia covers a wide range of AI topics including large language models (LLMs), AI agents, machine learning fundamentals, prompt engineering, AI automation, generative AI tools, Malaysian AI regulations, local vendor landscape, and real-world AI use cases relevant to the Malaysian market.

How do I search for AI topics on AIWiki Malaysia?

You can use the search bar at the top of the site to find articles by keyword or topic. Articles are also organised by category, so you can browse by subject area such as Models, Tools, Concepts, or Use Cases.

Is AIWiki Malaysia available in Bahasa Malaysia?

Yes. AIWiki Malaysia publishes content in both English and Bahasa Malaysia to serve the full breadth of the Malaysian professional and student community. Language availability is indicated on each article page.

How can I submit a topic or suggest an article?

You can suggest topics or submit article ideas by contacting the AIWiki Malaysia team at admin@aiteragrid.com. AITG Sdn Bhd reviews all submissions and publishes content that meets editorial accuracy standards.

Falcon LLM

A family of open-weight large language models developed by the Technology Innovation Institute (TII) in Abu Dhabi, released under permissive licenses and used widely across enterprise and research applications.

6 min readLast updated June 2026Models

Falcon LLM is a family of open-weight large language models developed by the Technology Innovation Institute (TII), the applied research arm of the Advanced Technology Research Council in Abu Dhabi, United Arab Emirates. First released in 2023, Falcon was the first major foundation-model release from outside the United States, China, and Europe to top the Hugging Face open LLM leaderboard, and it played a significant role in establishing the practice of releasing strong open-weight models under permissive commercial licenses. The Falcon series has expanded to include base, instruction-tuned, and small-form-factor variants suitable for on-device and edge deployments.

Model family

The first Falcon release in mid-2023 introduced Falcon 7B and Falcon 40B, two autoregressive decoder-only transformer models trained on RefinedWeb, a large filtered corpus derived from CommonCrawl, supplemented by curated text. Falcon 40B briefly held the top position on the Hugging Face Open LLM Leaderboard, surpassing contemporaneous open models from Meta and academic groups. In September 2023, TII released Falcon 180B, a 180-billion-parameter model trained on roughly 3.5 trillion tokens that achieved performance comparable to PaLM 2 Large and approached the capabilities of contemporary closed models such as GPT-3.5 on standard benchmarks.

Subsequent generations have focused on efficiency. Falcon 2 introduced multimodal capabilities and competitive 11-billion-parameter models, while the Falcon 3 series, released in late 2024, emphasised smaller, more efficient models in the 1-billion to 10-billion parameter range optimised for edge inference and resource-constrained deployment. The lineage is sometimes grouped under the broader Falcon Perception programme, which extends the family to multimodal and on-device perception tasks.

Architecture

Falcon models follow the standard decoder-only transformer architecture with several efficiency-oriented modifications. Multi-query attention reduces memory bandwidth requirements during inference by sharing key and value projections across attention heads. Rotary positional embeddings and FlashAttention enable efficient long-context training. The training pipeline relied heavily on data quality engineering: RefinedWeb demonstrated that aggressively filtered and deduplicated web data could match or outperform curated corpora such as The Pile, a finding that influenced the broader open-weight community.

Licensing and ecosystem

A defining feature of Falcon is its Apache 2.0 license, which permits unrestricted commercial use without revenue caps or other constraints. This contrasts with the more restrictive community licences attached to some peer open-weight families and has made Falcon attractive to enterprise users requiring legal clarity. Falcon weights are distributed primarily through Hugging Face and have been integrated into popular inference stacks including vLLM, Text Generation Inference, llama.cpp, and Ollama. Cloud providers including AWS and Microsoft Azure have made Falcon variants available through their managed model catalogues.

Applications

Falcon models are used across general-purpose chatbots, code assistants, retrieval-augmented generation pipelines, document summarisation, and domain-specific fine-tunes. Their permissive licence has made them particularly common in regulated industries — finance, healthcare, and government — where on-premises deployment and clear commercial terms are required. The smaller Falcon 3 models have been deployed in edge and embedded scenarios, including industrial inspection, robotics, and devices with limited compute, where 180-billion-parameter models would be impractical.

Comparison to peer models

| Model family | Origin | Largest size | Licence | Notable feature | |---|---|---|---|---| | Falcon | TII (UAE) | 180B | Apache 2.0 | First major non-US/CN/EU release; permissive licence | | Llama | Meta AI (USA) | 405B+ | Llama Community Licence | Largest open ecosystem | | Mistral | Mistral AI (France) | 8x22B (Mixtral) | Apache 2.0 / commercial | Mixture-of-experts variants | | Qwen | Alibaba (China) | 72B+ | Apache 2.0 (some variants) | Strong multilingual coverage |

Significance

Falcon's release was strategically important for several reasons. It established the UAE as a major contributor to open foundation models, demonstrated that high-quality web data engineering could rival curated corpora, and helped normalise the release of large frontier-scale models under permissive licences. It also catalysed regional investment in AI infrastructure and talent in the Gulf region and inspired peer initiatives across the Middle East and South-East Asia.

Malaysian Context — Falcon Adoption and Regional Open Models

Falcon has been adopted in Malaysia primarily through Hugging Face and via cloud providers such as Amazon Bedrock, Google Vertex AI, and Microsoft Azure AI, all of which have Malaysian customers via local hyperscaler regions or distribution partners. Malaysian system integrators, including those active in MDEC's Digital Hub ecosystem in Cyberjaya and TechCity Kuala Lumpur, have used Falcon variants for proof-of-concept retrieval-augmented generation deployments in regulated sectors where the Apache 2.0 licence simplifies legal review.

Government and government-linked entities in Malaysia — including ministries supported by the National AI Office, MAMPU, and MDEC — have shown interest in open-weight models as a path to sovereign AI capability, with Falcon, Llama, Mistral, and Qwen evaluated alongside any future locally trained models. The MyDigital Blueprint and Malaysia AI Roadmap both emphasise local data residency and capability building, and the Personal Data Protection Act (PDPA) constrains how training and inference data may be processed, factors that often favour on-premises or sovereign-cloud deployments of open-weight families like Falcon.

In financial services, Bank Negara Malaysia's Risk Management in Technology framework requires institutions to document and validate AI models in production, and the Securities Commission Malaysia maintains complementary guidance for capital market participants. Banks including Maybank, CIMB, RHB, and Public Bank, together with insurers and takaful operators, have piloted open-weight LLMs including the Falcon family for internal knowledge assistants, customer service summarisation, and document processing, typically deployed within their own data centres or in restricted cloud tenancies to meet PDPA and supervisory expectations.

In academia, Falcon has been used in Bahasa Melayu research by groups at Universiti Malaya, USM, UKM, and UTM, often as a base for instruction tuning and continued pre-training on Malay-language corpora, complementing similar efforts using Llama and other open families. The Malaysia AI Governance Framework treats open-weight model deployment as one of several legitimate strategies, alongside the use of frontier APIs from OpenAI, Anthropic, and Google.

References

Technology Innovation Institute. (2023). Falcon 180B: World's Most Powerful Open LLM. tii.ae.
Penedo, G., et al. (2023). The RefinedWeb Dataset for Falcon LLM. arXiv.
Almazrouei, E., et al. (2023). Falcon Series of Open Language Models. TII technical report.
Hugging Face. (2024). Open LLM Leaderboard. huggingface.co.

Tags:falcon large-language-model open-source tii uae

Type	Family of open-weight large language models
Developed by	Technology Innovation Institute (TII), Abu Dhabi
First released	2023 (Falcon 7B and 40B)
Licence	Apache 2.0
Largest size	Falcon 180B (180 billion parameters)
Related	Llama, Mistral, Hugging Face

Model family

Architecture

Licensing and ecosystem

Applications

Comparison to peer models

Significance

See Also

References

References