ANALOG EDGE

Silicon Temperature Sensors Protect Your System
By Uwe Kopp, DCS Marketing Manager

Two facts in building electronic systems force system designers to think more and more about the temperature budget. One chief driver is that cost and size of the PCB and enclosures have to be reduced resulting in less air flow to cool semiconductors which are both power hungry and at the same time sensitive. The other factor is the need for ever higher performance from processors, DSPs and Microcontrollers. Moore's law of doubling processor power is having the side effect that power dissipation is raising proportionally. These trends cause the temperature in enclosures to reach higher and higher values impacting reliability and even functionality of the whole design. Several strategies have evolved to ensure reliable performance of today's electronics. One key protection element is the temperature sensing device.

Thermal modeling

The first step after architecting the functional design of boards and enclosures should be thermal modeling. The resulting color plots show clearly where the bottlenecks in the design are and whether active temperature sensing and control are necessary. Modeling will also confirm if simpler passive methods are sufficient. A good example of a modeling tool, which is available for power supplies, can be found at: http://www.national.com/appinfo/webench/webtherm/.  This modeling tool was developed in cooperation with Flomerics.


Thermal simulation results for a switching regulator

After identifying the critical areas in your application a protection strategy has to be defined. There are various alternatives available to protect, enhance or ensure extended operation.

Protection strategies

Up until a few years ago copper traces or metal blocks used as heat sinks were the means to protect heat dissipating devices. This method is still fashionable if heat sink weight is no problem and especially where other techniques such as fan cooling or reduction of power supply voltage and/or processor speed are not viable. From a heat management point of view a lot can be learned from computer manufacturers. Protection mechanisms that have been used in desktop and laptop computers have found their way into other industries from game manufacturers to automotive applications. Here are the different ways of protecting an electronic system against overheating or operating it in its optimal environmental temperature:

  • Passive protection by convection
  • On/Off protection through thermostats
  • Active temperature sensor protection
  • Hardware monitoring with voltage and fan control

One important fact to note here is that silicon temperature sensors can work from -55degC to 150degC when housed in plastic packages. For more advanced applications, over 150degC and up to 180degC, the sensors can still be used in die form. For most electronic applications this method should be sufficient. Outside of this range passive sensors like negative temperature coefficient thermistors (NTCs) or thermocouples should be used.

Passive protection

The most economic way to protect electronic circuitry is good power de-rating, free air convection and heatsinking. For uncomplicated systems using minimum power hungry devices where space and weight are not an issue this method works fine. To ensure the reliability levels guaranteed by the IC manufacturers are met in your design, the junction temperature of the chips themselves must not exceed the values given in the datasheets. Normally these temperatures are around 150degC.

All power device datasheets also give formulas to calculate the size of a heat sink through power de-rating. The formula considers the thermal resistance of the different elements through which the heat dissipates, such as chip to package, package to heat sink and finally convection to air.
To optimize heat flow PCBs should be mounted horizontally with the IC's which are dissipating the greatest heat mounted on the upper side edge. At a system, or sub-system, design level these same simple rules apply, the hot device should be mounted so its heat can rise and find a route out of the system.

Silicon thermostats

Another simple and efficient method for protecting electronic devices from thermal runaway are silicon thermostats. They can be used to bring the power devices in question into standby mode, disable the power supply or start a fan. The LM26 and LM27 series of silicon temperature sensors are very cost efficient implementations of this concept. They come in various trip point options from 23 deg C up to 145 deg C to ensure flexible usage in a wide variety of applications. Also the reset point, or hysteresis, is selectable to be 2 deg C or 10 deg C to take care of the different thermal lag times and thermal masses that are inherent in electronic systems.

The sensor is paired closely with the power device and the temperature can be accurately estimated. The temperature trip point has to be evaluated empirically. As a rule of thumb the air temperature at the sensor is about 15deg lower than on the package of the monitored device. This type of measurement can only give a rough estimate but will protect efficiently. Due to the inherent inaccuracy of this measurement method the sensor itself does not have to be very accurate. Temperature accuracies between +/-2degC and +/-4degC are sufficient.

Digital temperature sensors

Digital temperature sensors integrate a diode sensing element, an analog to digital converter (ADC), registers and an interface block. Digital temperature sensors communicate with the microprocessor through digital interfaces such as SPI, SMBus or I2C. They are available with ADC resolutions from 8- 16-Bits. Higher resolution devices can be trimmed more accurately at the end of line by the manufacturer. Accuracies for an 8-Bit device are around +/-4 degC and can be as good as 0.33degC at room temperature for 16-Bit devices. The smaller the temperature range a sensor has to cover the better the accuracy for this window can be defined. This is because of the deviation of the temperature transfer function from the linear ideal due to its hyperbolic nature.

Accuracy is needed for various reasons. One such example is where system parameters change rapidly. The closer you can operate at the limits without risking reliability the better. Assume you are using an MCU that operates up to 125degC before functioning abnormally. Using a temperature sensor with +/- 4deg accuracy, an alarm to switch off the processor should be set at 117degC. A better sensor with +/-1 degC accuracy gives headroom up to 123degC.

The robust data output stream means sensors can be placed 20-30 centimeters from the MCU even in relatively noisy environments. No additional circuitry or pins on the controller are needed. Together with other digital interface devices the sensor can operate on the same bus. Analog sensors on the other hand need a dedicated Microcontroller with integrated AD converter and would have to be placed very closely to the MCU as the analog output signal that carries the temperature information is susceptible to noise.

Digital sensors such as the LM75B can handle up to 300mV noise on the bus without hanging up or delivering false data. Analog sensors typically output 10mV/degC and consequently give erroneous data if 10mV of noise is introduced. Filtering can help, but it does not eliminate this inherent problem.

Remote diode sensing

Even with a very accurate digital temperature sensor one never acquires the exact temperature of the chip being protected. Several manufacturers of 'hot' devices like FPGAs, MCUs, Video Processors or Power FETs have implemented a feature used on Intel Processors.

Through this built-in diode, the device temperature can be monitored very closely and accurately. National offers a variety of temperature sensors with the capability to measure remote diodes. Details can be found on the Thermal Management Selection Guides of National Semiconductor web site at http://www.national.com/appinfo/tempsensors/.  To accommodate many different remote diode characteristics, the user can select diode non-ideality factors and can also compensate for trace resistance. The latter is important as a trace resistance of just 1ohm can introduce a temperature error of 1degC because these sensors measure microvolt signals.

Remote diode sensing chips are available with up to three remote inputs to measure different spots in a system. The interface to all of them is SMBus from the PC world. But this should not be a stumbling clock to using them in a different application. SMBus is very similar in functionality to I2C and actually more robust because a timeout function is already implemented that prevents one user on the bus from locking the whole system.

Fan control

Rather than stopping operation when the temperature rises, many of the remote sensing devices described so far allow the control of DC fans. Depending on the application these devices may well be sufficient to manage the thermal budget and are relatively easy to implement at the system level.

Let's look at the problems that can occur when designing a system with a cooling fan. Most fan controllers are not simply on-off devices, but use Pulse Width Modulation (PWM) to control DC fan speed. Unfortunately many of the PWM outputs operate at frequencies below 20kHz resulting in audible noise when spinning up and down. To avoid noise audible by human ears the PWM switching has to take place either at very high or very low frequencies. To overcome this, devices implementing free programmable PWM switching frequencies at up to 180kHz in steps are available, such as National's remote diode sensor LM63. If resonance frequencies are encountered with either the enclosure, the fan itself or other system components is in this upper frequency band then the device also can be used with any PWM frequency between 22Hz and 703Hz to tune the noise out of the system.

In many systems the best acoustic fan noise performance is achieved when the PWM versus temperature transfer function curve is parabolic in shape. An example of how the parabola function is implemented into the lookup table is given in the datasheet of the LM63. This temperature sensor allows the user to program a transfer function individually into a lookup table to match system needs.


Example of LM63 transfer function

Summary

Reliability and functionality of any integrated circuit is very dependent on the environmental conditions. The greatest influences are electrical and thermal stresses. Electrical stress factors are normally very well understood and taken care of. Thermal stress factors are increasingly becoming an issue with higher performance devices in smaller cabinets. In the past it was sufficient to implement passive methods to get rid of the heat. However today's designs require more in depth analysis of the thermal budget and closed loop temperature control is fast becoming an important feature of electronic systems. Failing to take these issues into account can result in more than burnt fingers.

For on line simulation go to http://webench.national.com

 

Further reading:
Bar-Cohen, Avram
- "Thermal Management of Electronics", The Electrical Engineering Handbook, pp 784-797, 1993

Kordyban, Tony -  Hot Air Rises and Heat Sinks Paperback, Jul-98, ASME Press, ISBN:0791800741

'Package Thermal Characterization' (pdf 78KB) and 'Understanding Integrated Circuit Package Power Capabilities' (pdf 564KB), downloadable from National Semiconductors website.