<h2>Decisions Tree</h2>
<p>Every node in the decision trees is a condition on a single feature, designed to split the dataset into two so that similar response values end up in the same set.</p>
<p>The measure based on which the (locally) optimal condition is chosen is called <em>impurity</em>.</p>
<ul>
  <li><strong>classification</strong>: Gini impurity or information gain/entropy</li>
  <li><strong>regression</strong>: variance.</li>
</ul>
<h2>Feature Importance</h2>
<p><a href="http://blog.datadive.net/selecting-good-features-part-iii-random-forests/">http://blog.datadive.net/selecting-good-features-part-iii-random-forests/</a></p>
<ul>
  <li>How much each feature decreases the weighted impurity in a tree.</li>
  <li>For a forest, the impurity decrease from each feature can be averaged and the features are ranked according to this measure.</li>
</ul>
<p>Note:</p>
<ul>
  <li>feature selection based on impurity reduction is biased towards <strong>preferring variables with more categories</strong></li>
  <li><strong>With correlated features, strong features can end up with low scores</strong>: dataset has two (or more) correlated features, then from the point of view of the model, any of these correlated features can be used as the predictor, with no concrete preference of one over the others. But once one of them is used, the importance of others is significantly reduced since effectively the impurity they can remove is already removed by the first feature.</li>
</ul>


Decision Tree, Random Forest, Gradient Boosted Trees

Google Ads vs Google Marketing Platform vs Google Ad Manager vs AdSense vs AdMob

Life of an Ad Request

Ads - Random Notes

Ads - Versus

Android - Concepts

Android - Overview

Android - Location

Android - Tools

Android - Trouble Shooting

Ansible Cheatsheet

apt/dpkg Cheatsheet

awk Cheatsheet

AWS CLI Cheatsheet

Azure CLI (az) Cheatsheet

Base64 Cheatsheet

Bash

Bazel Cheatsheet

HomeBrew Cheatsheet

Ceph Cheatsheet

Chrome Cheatsheet

Cilium Cheatsheet

AWS vs GCP vs Azure Service Comparison

containerd Cheatsheet

Containers Cheatsheet

C++ Cheatsheet

crictl cheatsheet

cron Cheatsheet

CSS Cheatsheet

CSV Cheatsheet

curl Cheatsheet

Shell Cheatsheet - Devices and File Systems

Docker Build Cheatsheet

Docker Cheatsheet

etcd Cheatsheet

gcloud CLI Cheatsheet

Git Cheatsheet

Go Cheatsheet

grep / egrep / fgrep Cheatsheet

Harbor Cheatsheet

Helm Cheatsheet

HG Cheatsheet

HTML Cheatsheet

HTTP Response Status Codes

Intellij Cheatsheet

Java Cheatsheet

JavaScript Cheatsheet

jq Cheatsheet

JSON Cheatsheet

k9s Cheatsheet

kind Cheatsheet

kubeadm Cheatsheet

kubectl Cheatsheet

LaTeX Cheatsheet

libvirt Cheatsheet

macOS cheatsheet

Markdown Cheatsheet

Minikube Cheatsheet

Cheatsheets - MySQL

Networking - Lookup Tables

Shell Cheatsheet - Networking Commands

Nginx Cheatsheet

Cheatsheets - openssl

PostgreSQL Cheatsheet

Protocol Buffers (Protobuf) Cheatsheet

Python Cheatsheet

QCOW Cheatsheet

RegEx Cheatsheet

sed Cheatsheet

Shell Cheatsheet - Commands

Shell Cheatsheet - Processes

Shell Cheatsheet - Scripts

Shell Cheatsheet - Shortcuts

Shell - Tips

Shell - Users and Groups

Signals Cheatsheet

Cloud Spanner Cheatsheet

Cheatsheets - SQL

ssh / mosh / sshuttle Cheatsheet

Syscalls Cheatsheet

systemd Cheatsheet

Shell Cheatsheet - tar / zip

Terraform Cheatsheet

Cheatsheets - tmux

Ubuntu Desktop Cheatsheet

Cheatsheets - Vim

VS Code Cheatsheet

Windows Cheatsheet

XML Cheatsheet

YAML Cheatsheet

yq Cheatsheet

Cloud - AWS

Hybrid Cloud

Cloud

C++ Keywords - auto

C++ - Best Practices

C++ - const

C++ - constexpr