Spaces:

zaidmehdi
/

manga-colorizer

Running

App Files Files Community

manga-colorizer / README.md

zaidmehdi

Upload README.md with huggingface_hub

a5eb2ea verified 7 months ago

preview code

raw

history blame

1.96 kB

	---
	title: MangaColorizer
	emoji: 🖌️🎨
	colorFrom: green
	colorTo: gray
	sdk: gradio
	app_file: src/main.py
	pinned: false
	license: mit
	---

	# MangaColorizer
	This project is a colorizer of grayscale images, and in particular for manga, comics or drawings.
	Given a black and white (grayscale) image, the model produces a colorized version of it.

	[Link to the Demo](https://huggingface.co/spaces/zaidmehdi/manga-colorizer)

	![Demo App](docs/images/demo_screenshot.png "Demo App")


	## How I built this project:
	The data I used to train this model contains 755 colored images from some chapters of Bleach, Dragon Ball Super, Naruto, One Piece and Attack on Titan. I also used 215 other images for the validation set, as well as 109 other images for the test set.

	In the current version, I trained an encoder-decoder model from scratch with the following architecture:
	```
	MangaColorizer(
	(encoder): Sequential(
	(0): Conv2d(1, 64, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1))
	(1): ReLU(inplace=True)
	(2): Conv2d(64, 128, kernel_size=(3, 3), stride=(2, 2), padding=(1, 1))
	(3): ReLU(inplace=True)
	(4): Conv2d(128, 256, kernel_size=(3, 3), stride=(2, 2), padding=(1, 1))
	(5): ReLU(inplace=True)
	)
	(decoder): Sequential(
	(0): ConvTranspose2d(256, 128, kernel_size=(4, 4), stride=(2, 2), padding=(1, 1))
	(1): ReLU(inplace=True)
	(2): ConvTranspose2d(128, 64, kernel_size=(4, 4), stride=(2, 2), padding=(1, 1))
	(3): ReLU(inplace=True)
	(4): ConvTranspose2d(64, 3, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1))
	(5): Tanh()
	)
	)
	```
	The inputs to the model are the grayscale version of the manga images, and the target is the colored version of the image.
	The loss function used is the MSE of all the pixel values produced by the model (compared to the target pixel values).
	Currently, it achieves an MSE of 0.00859 on the test set.

	For more details, you can refer to the docs directory.