analog biological weight representationsziyang.eecs.umich.edu/iesr/lectures/l15-2x2.pdf ·...

Introduction to Embedded Systems Research:Weight precision alternatives

Robert Dick

[email protected] of Electrical Engineering and Computer Science

University of Michigan

1.1040

2.1041

385

2.1039

704

1.1039

36

2.1040

1734

0.1039

1

3.1040

4

4.1039

642

3.1039

4.1040

409

3.1041

396665

5.1042

108612774

5.1040

337

4.1041

10644

6.1044

117

6.1039

609

6.1045

164

6.1040

841434 723938

6.1042

209

5.1041

4 12 164

5.1045

529417

5.1044

140105 88

5.1039

154 10551 90677

7.1039

1248

7.1047

2106

7.1040

29773362 241966 22903106

6.1041

3119 1936 40

6.1047

4128 253

8.1050

4

8.1042

9.1050

24784

8.1039

89632

8.1044

2840

8.1040

1088

9.1039

10.1040

4

11.1039

957

10.1050

144

11.1040

156

10.1041

32 16

10.1047

87780

10.1045

2152

10.1039

1165

12.1040

145

13.1052

2404

12.1042

24

12.1045

33008

12.1044

8217

12.1041

8

12.1039

135427

14.1049

113

14.1040

229

13.1050

132

13.1042

74433

13.1041

17187

13.1040

2715

13.1039

170059 90

15.1040

16

15.1050

242 1225

14.1050

237

14.1042

6200

14.1041

4

14.1044

720

14.1052

20

14.1039

84 36939

16.1040

88

15.1049

6

15.1039

56

17.1054

2919

16.1050

129

16.1041

36

16.1047

49154

16.1045

2632

16.1049

27

16.1052

16

16.1039

222734

18.1048

133

18.1039

3439

18.1040

241

17.1050

16

18.1049

36 339224

17.1042

172832

17.1041

49620 448

17.1040

1376493 2883

17.1045

72

17.1044

1073

17.1049

55441826

17.1048

124 3648290

17.1052

3547

17.1039

2484110477445

17.1047

24

19.1040

72

19.1039

6 88 60631617

18.1041

28 76

18.1047

3305

18.1054

13744

20.1040

109

20.1039

269

19.1052

8

19.1047

11712

19.1049

10

19.1054

7520

21.1039

82

20.1049

5

20.1047

4896

20.1054

864

22.1040

4

22.1050

23.1040

4

22.1039

144

24.1058

3389

23.1050

76

24.1040

4

23.1042

17528

23.1041

4

23.1054

24

23.1044

6234

23.1058

261

23.1049

4

23.1052

2944

23.1039

3069658

25.1040

80

24.1050

4

24.1039

58

25.1039

26.1040

4

27.1039

489

26.1050

4

27.1040

4

26.1047

3808

26.1045

2248

26.1058

113 80

26.1049

11

26.1039

66

26.1054

840

28.1055

84266 1542

27.1042

1229

27.1041

29619

27.1058

12

27.1049

3984

27.1048

35337

29.1040

262

29.1056

164

29.1039

742

28.1050

4 2464

28.1042

2137912

29.1055

1128

28.1041

4 2633 4

28.1040

2716132 691

28.1058

24 84 32

28.1049

36176 3

28.1048

1192 48

28.1039

365 110957 24475

29.105029.104229.104129.104729.104529.105829.104929.105229.1054

2

3

4

5

6

7

8

0 1 2 3 4 5 6 7 8

Pow

er

(mW

)Time (s)

35 40 45 50 55 60 65 70 75 80 85 90

-8 -6 -4 -2 0 2 4 6 8

-8

-6

-4

-2

0

2

4

6

8

35 40 45 50 55 60 65 70 75 80 85 90

Temperature (°C)

Position (mm)

Temperature (°C)

Glia

Remember when neurons were the only nervous system cells to signal?

Astrocytes also signal.

May be the proximal cause of fMRI blood flow changes.

2 R. Dick EECS 598-13

Digital biological weight representations

R. Wessel, C. Koch, and F. Gabbiani, “Coding of time-varying electric fieldamplitude modulations in a wave-type electric fish,” J. Neurophysiology,vol. 75, no. 6, June 1996.


Analog biological weight representations

D. Debanne, A. Bialowas, and S. Rama, “What are the mechanisms foranalogue and digital signalling in the brain?” Nature Reviews Neuroscience,vol. 14, pp. 63–69, Jan. 2013.


Conventional machine learning weight representation

sign · significand · 2exponent

Single-precision: 24-bit significand, 8-bit exponent.

Double-precision: 53-bit significand, 11-bit exponent.

(Unnecessarily) large dynamic range and precision.


Reduced-precision floating point

16-bit (half-precision) common.

11-bit significand, 5-bit exponent.

Often reduces accuracy by a few percent or less.


Fixed point

Integer, with an implied decimal position maintained by programmer orcompiler.

Higher potential efficiency than floating point.

Harder to deal with in practice for programmer or compiler.

Must determine maximum value at each stage in a computation DAG anduse appropriate implied scale.

Integer is a degenerate case of fixed point.


Logarithmic

V. Sze, Y.-H. Chen, T.-J. Yang, and J. Emer, “Efficient processing of deepneural networks: A tutorial and survey,” Proc. IEEE, vol. 105, no. 12, Dec.2017.


Binary

Degenerate case of integer.

Generally results in multi-digit accuracy reduction.

The fact that it works reasonably may be surprising.

Can use structural members to represent non-binary numbers.

This is inefficient compared to conventional number representations.

E.g., 1 + 1 + 1 vs. 1 · 20 + 1 · 21 + 1 · 22.


Other encodings

Hinted at by weight compression research.

E.g., use indexed table of most common weights.

Other encodings possible.


analog biological weight representationsziyang.eecs.umich.edu/iesr/lectures/l15-2x2.pdf ·...

Documents