New Development: NoiseWhaLLM

2024 | Liam Daly Manocchio

We are excited to announce a groundbreaking addition to the NoiseWhale ecosystem! NoiseWhale now features an in-house Large-Language-Model (LLM) capable of both domain-specific generative and masked modeling tasks. What sets this innovation apart is its unique capability to handle tabular and multi-variate data. Currently, the only transformers in industry that can process tabular data, leverage the IBM token trick1, which forces the quantization of real values, causing a loss of resolution and poor understanding of continuous numerical values. RIFT has developed a new tokenisation approach that prevents this loss of resolution, and stabilises training. This remarkable advancement allows NoiseWhale to leverage transformers for modeling datasets that were previously inaccessible to ChatGPT and other LLMs.

Revolutionizing Data Modeling with NoiseWhale's LLM

At NoiseWhale, we are dedicated to pushing the boundaries of natural language processing and machine learning. Our in-house Large-Language-Model is a testament to this commitment. Unlike conventional LLMs, our model is engineered to excel in domain-specific generative tasks and masked modeling.

However, what truly sets our LLM apart is its capability to work with tabular and multi-variate data without relying on the IBM token trick. This technical breakthrough eliminates the limitations that often hinder other models, making it a game-changer for industries dealing with complex datasets.

Unlocking New Possibilities

By enabling the ingestion of tabular data, NoiseWhale's LLM opens up new avenues for data analysis and interpretation, particularly in scientific domains. Whether you're dealing with intricate sensor logs, multi-dimensional datasets, or any other form of structured information, our LLM empowers you to harness the full potential of transformer models.

Imagine effortlessly processing sensor data from various sources, making sense of complex patterns, and generating insightful reports with ease. NoiseWhale's LLM is designed to make this a reality, offering unprecedented capabilities for data-driven decision-making. You're no longer confined to natural langauge or 1D signals.

Experience the Future of Data Modeling

We invite you to join us in exploring the possibilities that NoiseWhale's in-house LLM brings to the table, on your datasets and applications. As we continue to refine and expand the capabilities of our model, you can be at the forefront of transformative advancements in data modeling and analysis.

Get in touch with our team today to learn more about how NoiseWhale's LLM can empower your organization to unlock the true potential of your data. Together, we can revolutionize the way you approach data-driven challenges and drive innovation in your industry.

Reference:
[1] - IBM token trick reference

Machine Learning
Recent Articles: