![How to Beat Proprietary LLMs With Smaller Open Source Models](/content/images/size/w750/2024/04/dvg_e.png)
latest
Apr
26
![How to Beat Proprietary LLMs With Smaller Open Source Models](/content/images/size/w750/2024/04/dvg_e.png)
How to Beat Proprietary LLMs With Smaller Open Source Models
Building your AI applications around open source models can make them better, cheaper, and faster
14 min read
Apr
08
![A Guide to Structured Generation Using Constrained Decoding](/content/images/size/w750/2024/04/constrain.webp)
A Guide to Structured Generation Using Constrained Decoding
The how, why, power, and pitfalls of constraining generative language model outputs
14 min read
Jul
22
![Modern Data Engineering and the Lost Art of Data Modelling](/content/images/size/w750/2023/07/Screenshot-2023-07-17-at-08.54.20.png)
Modern Data Engineering and the Lost Art of Data Modelling
Necessity was the mother of invention. Now, an abundance of cheap storage and compute makes for data anarchy.
5 min read
Jun
23
![Machine Learning in the Life Sciences Has a Data Problem](/content/images/size/w750/2023/06/aa.png)
Machine Learning in the Life Sciences Has a Data Problem
In a time of AI prosperity, the life sciences are at risk of being left behind
6 min read
Jun
07
![Approximating Shapley Values for Machine Learning](/content/images/size/w750/2024/04/approx.webp)
Approximating Shapley Values for Machine Learning
The how and why of Shapley value approximation, explained in code
6 min read
Apr
07
![Homogeneous neighbourhoods in the Schelling model of segregation](/content/images/size/w750/2023/04/gnillehcs.jpeg)
Gnillehcs' Model of Integration
What happens to segregated communities as people increasingly seek diversity?
3 min read
Dec
31
![A power set of feature coalitions.](/content/images/size/w750/2022/12/power_set_A-3.png)
How Shapley Values Work
In this article, we will explore how Shapley values work - not using cryptic formulae, but by way of code and simplified explanations
10 min read
Aug
13
![Industry Perspective: Tree-Based Models vs Deep Learning for Tabular Data](/content/images/size/w750/2024/04/trees.webp)
Industry Perspective: Tree-Based Models vs Deep Learning for Tabular Data
Tree-based models aren't just highly performant - they offer a host of other advantages
3 min read
Jul
11
![4 Pandas Anti-Patterns to Avoid and How to Fix Them](/content/images/size/w750/2022/07/pandas-red-dark.png)
4 Pandas Anti-Patterns to Avoid and How to Fix Them
pandas is a powerful data analysis library with a rich API that offers multiple ways to perform any given data
9 min read
May
16
![Supervised Clustering: How to Use SHAP Values for Better Cluster Analysis](/content/images/size/w750/2022/05/feature_image-2.png)
Supervised Clustering: How to Use SHAP Values for Better Cluster Analysis
Cluster analysis is a popular method for identifying subgroups within a population, but the results are often challenging to interpret
9 min read