Upload ChatGLM-6B-INT4-QE
Update README.md
Slim embedding
Drop icetk dependency
Fix attention score on mps
Upload pytorch_model.bin
Fix parallel cpu kernel