Заключение
643
758. Vinyals, O., Toshev, A., Bengio, S., and Erhan, D. (2015b). Show and tell: a neural
image caption generator. In CVPR’2015. arXiv:1411.4555.
759. Viola, P. and Jones, M. (2001). Robust real-time object detection. In International
Journal of Computer Vision.
760. Visin, F., Kastner, K., Cho, K., Matteucci, M., Courville, A., and Bengio, Y. (2015).
ReNet: A recurrent neural network based alternative to convolutional networks.
arXiv preprint arXiv:1505.00393.
761. Von Melchner, L., Pallas, S. L., and Sur, M. (2000). Visual behaviour mediated by
retinal projections directed to the auditory pathway. Nature, 404(6780), 871–876.
762. Wager, S., Wang, S., and Liang, P. (2013). Dropout training as adaptive regulariza-
tion. In Advances in Neural Information Processing Systems 26, pages 351–359.
763. Waibel, A., Hanazawa, T., Hinton, G. E., Shikano, K., and Lang, K. (1989). Phoneme
recognition using time-delay neural networks. IEEE Transactions on Acoustics,
Speech, and Signal Processing, 37, 328–339.
764. Wan, L., Zeiler, M., Zhang, S., LeCun, Y., and Fergus, R. (2013). Regularization of
neural networks using dropconnect. In ICML’2013.
765. Wang, S.
and Manning, C. (2013). Fast dropout training. In ICML’2013.
766. Wang, Z., Zhang, J., Feng, J., and Chen, Z. (2014a). Knowledge graph and text jointly
embedding. In Proc. EMNLP’2014.
767. Wang, Z., Zhang, J., Feng, J., and Chen, Z. (2014b). Knowledge graph embedding by
translating on hyperplanes. In Proc. AAAI’2014.
768. Warde-Farley, D., Goodfellow, I. J., Courville, A., and Bengio, Y. (2014). An empirical
analysis of dropout in piecewise linear networks. In ICLR’2014.
769. Wawrzynek, J., Asanovic, K., Kingsbury, B., Johnson, D., Beck, J., and Morgan, N.
(1996). Spert-II: A vector microprocessor system. Computer, 29(3), 79–86.
770. Weaver, L. and Tao, N. (2001). The optimal reward baseline for gradient-based rein-
forcement learning. In Proc. UAI’2001, pages 538–545.
771.
Weinberger, K. Q. and Saul, L. K. (2004). Unsupervised learning of image manifolds
by semidefinite programming. In CVPR’2004, pages 988–995.
772. Weiss, Y., Torralba, A., and Fergus, R. (2008). Spectral hashing. In NIPS, pages
1753–1760.
773. Welling, M., Zemel, R. S., and Hinton, G. E. (2002). Self supervised boosting. In
Advances in Neural Information Processing Systems, pages 665–672.
774. Welling, M., Hinton, G. E., and Osindero, S. (2003a). Learning sparse topographic
representations with products of Student t-distributions. In NIPS’2002.
775. Welling, M., Zemel, R., and Hinton, G. E. (2003b). Self-supervised boosting. In
S. Becker, S. Thrun, and K. Obermayer, editors, Advances
in Neural Information
Processing Systems 15 (NIPS’02), pages 665–672. MIT Press.
776. Welling, M., Rosen-Zvi, M., and Hinton, G. E. (2005). Exponential family harmoni-
ums with an application to information retrieval. In L. Saul, Y. Weiss, and L. Bottou,
editors, Advances in Neural Information Processing Systems 17 (NIPS’04), volume
17, Cambridge, MA. MIT Press.
777. Werbos, P. J. (1981). Applications of advances in nonlinear sensitivity analysis. In
Proceedings of the 10th IFIP Conference, 31.8–4.9, NYC, pages 762–770.
778. Weston, J., Bengio, S., and Usunier, N. (2010). Large scale image annotation: learning
to rank with joint word-image embeddings. Machine Learning, 81(1), 21–35.