File size: 1,363 Bytes
aaaa4f9 2844fc7 ad71777 b9d9695 2844fc7 b9d9695 2844fc7 b9d9695 2844fc7 b9d9695 c70c6aa 2844fc7 aaaa4f9 b9d9695 a6edd9a 989f366 7492a93 4992c61 989f366 eb2b838 7bef6dc |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 |
---
library_name: tf-keras
license: apache-2.0
metrics:
- accuracy
tags:
- tabular-classification
- tensorflow
model-index:
- name: TF_Decision_Trees
results:
- task:
type: structured-data-classification
dataset:
name: Census-Income Data Set
type: census
metrics:
- type: accuracy
value: 96.57
- type: validation loss
value: 0.227394
---
# TensorFlow's Gradient Boosted Trees Model for structured data classification
Use TF's Gradient Boosted Trees model in binary classification of structured data <br />
* Build a decision forests model by specifying the input feature usage.
* Implement a custom Binary Target encoder as a Keras Preprocessing layer to encode the categorical features with respect to their target value co-occurrences, and then use the encoded features to build a decision forests model.<br />
The model is implemented using Tensorflow 7.0 or higher. The US Census Income Dataset containing approximately 300k instances with 41 numerical and categorical variables was used to train it. This is a binary classification problem to determine whether a person makes over 50k a year.<br />
Author: Khalid Salama
Adapted implementation: Tannia Dubon
Find the colab notebook at https://github.com/tdubon/TF-GB-Forest/blob/c0cf4c7e3e29d819b996cfe4eecc1f2728115e52/TFDecisionTrees_Final.ipynb
|